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TITLE 

C/S-PRENYLTRANSFERASES FROM THE RUBBER-PRODUCING 
PLANTS RUSSIAN DANDELION (TARAXACUM KOK-SAGHYZ) AND 
SUNFLOWER (HELIANTHUS ANNUS) 
5 FIELD OF THE INVENTION 

This invention is in the field of plant molecular biology. More 
specifically, this invention pertains to the identification of c/s- 
prenyltransferase genes preferentially expressed in the rubber-producing 
plants Taraxacum kok-saghyz (russian dandelion) and Helianthus annus 
10 (sunflower) and their utility in altering natural rubber production in 
transgenic plants. 

BACKGROUND OF THE INVENTION , 
Natural rubber (c/s-1 ,4-polyisoprene) is produced in about 
2000 plant species (usually as a constituent of plant latex) with varying 
15 degrees of quality and quantity. Several well-studied examples of rubber- 
producing plants include: 

1. Indian laurel (Ficus elastics), a well-known household plant that 
produces a rubber-containing latex. 

2. Trees of the Sapotacae family (Palaquim gufta and P. 
20 oblongifolia), located in the Malaysian peninsula and 

responsible for gutta percha latex (a viscous, grayish latex that 
exudes slowly from cuts in the bark and rapidly turns brown 
after exposure to the air). 

3. The tropical American tree Mimusops balata, which produces 
25 Balata latex as white or reddish exudates. 

4. The tropical American saprodilla tree Archras zapote, which 
produces Chicle 

5. The Central American tree Castilla elastica, which produces 
caucho negro rubber. 

30 6. The Brazilian species, Manihot glazovii, which produces ceara 

rubber. 

7. The dandelion species kok-saghyz (Taraxacum kok-saghyz; 
from Kazakhstan) and krim-saghyz (T megalorhizon] found in 
the Crimea and throughout the Mediterranean region), which 

35 produce a high-quality rubber in their roots! 

8. The non-latex producing American desert shrub guayule 
(Parfhenium argentatum), in which rubber is produced 
seasonally within parenchymatous cells of the stem and root, 

1 
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and its isolation requires harvesting of the plant and maceration 
of the tissue. 

The natural rubbers produced by each of these species differ in one or 
more of their properties. In particular, differences in molecular weight and 
5 molecular weight distribution have been observed in natural rubbers 

depending on their plant origin (Backhaus, R.A. Israel Journal of Botany 
34:283-293(1985)). 

Natural rubber, despite the development of many synthetic polymer 
alternatives, remains a high-volume commodity material based on its 

10 supeior properties of elasticity, resilience, and resistence to high 
temperature. Currently, some 6,810,000 tons of natural rubber are 
produced annually. Despite this abundance, latex tapped from the tree 
Hevea brasiliensis is today the only significant commercial source of 
natural rubber and it is expected that global demand will soon be greater 

15 than supplies. Thus, there is significant interest in studying rubber 

biosynthesis and the differences between rubber produced by Hevea to 
other natural rubbers, in order to develop alternative rubber sources. In 
particular, it would be useful to industry to have available rubbers with 
different molecular weight averages (higher and lower than Hevea rubber) 

20 and distributions. For example, rubbers with molecular weights lower than 
those obtained from H. brasiliensis may have distinct advantages over the 
Hevea material in certain applications due to their ease of processing 
(Nor, H.M:, and Ebdon, J.R. Progress in Polymer Sci. 23: 143-177 (1998); 
Meeker, T. Low Molecular Weight Polyisoprenes Offer Versatility In 

25 Bonding Techniques. Adhesives Age; pp. 23-26 (July* 1998)). Although 
the molecular weights of rubbers synthesized in in vitro experiments with 
isolated, enzymatically-active rubber particles are highly influenced by the 
concentrations of initiator allylic diphosphate and isopentenyl diphosphate 
(IPP), the intrinsic properties of the c/s-prenyltransf erases themselves also 

30 play a role in determining the size of the rubber molecules they produce 
(Cornish, K. Phytochemistry 57: 1123-1134 (2001)). 

C/s-prenyltransferases are a family of enzymes that are responsible 
for synthesizing natural rubbers, by catalyzing the sequential addition of 
C 5 units (in the form of isopentenyl pyrophosphate (IPP)) to an initiator 

35 molecule in head-to-tail condensation reactions. The initiator molecules 
themselves are derived from isoprene units through the action of distinct 
prenyltransferases. These initiators are allylic terpenoid diphosphates 
such as dimethylallyldiphosphate (DMAPP; C 5 ), geranyl diphosphate 

2 
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(GPP; C 10 ), farnesyl diphosphate (FPP; C 15 ), and geranylgeranyl 
diphosphate (GGPP; C 2 o). Genes encoding the enzymes which 
synthesize these allylic terpenoid diphosphates have been cloned from a 
number of organisms, including plants, and all of these genes encode 
5 polypeptides with conserved regions of homology (McGaivey et ah, Plant 
Cell 7:1015-1026 (1995); Chappell, J., Annu. Rev. Plant Physiol. Plant 
MoL Biol. 46:521-547 (1995)). All of these gene products condense 
isoprene units in the trans- configuration. Prenyltransferases that 
condense isoprene units in a c/s-configuration have only recently been 

10 identified in microbes and plants. Most notable to the present disclosure 
herein is the discovery of c/s-prenyltransferase gene products in latex of 
the rubber-producing species Hevea brasiliensis (WO01/21650; GenBank 
Accession Numbers AY1 24934, AY1 24474, AY1 24473, AY1 24472, 
AY1 24471, AY1 24470, AY1 24469, AY1 24468, AY1 24467', AY1 24466, 

15 AY1 24465, AY1 24464; see also AB061236 and AB074307). 

In the present disclosure, the problem to be solved therefore is to 
identify new plant c/s-prenyltransferase genes. These genes will have 
utility in modification of the properties of natural rubbers obtained from 
plants. Applicants have solved the stated problem by identifying plant 

20 genes encoding c/s-prenyltransferases from rubber-producing russian 
dandelion and sunflower species (both of which produce natural rubbers 
with different properties than those obtained from H. brasiliensis). 
Additionally, Applicants have discovered diagnostic features within the 
gene sequences of c/s-prenyltransferases from rubber-producing species. 

25 SUMMARY OF THE INVENTION 

Accordingly the invention provides an isolated nucleic acid 
molecule encoding a c/s-prenyltransferase enzyme, selected from the 
group consisting of: 

a) an isolated nucleic acid molecule encoding the amino acid 
30 sequence as set forth in SEQ ID NOs:4 and 6; 

b) an isolated nucleic acid molecule that hybridizes with (a) 
under the following hybridization conditions: 0.1X SSC, 0.1% SDS, 
65°C and washed with 2X SSC, 0.1% SDS followed by 0.1 X SSC, 
0.1% SDS; or 

35 an isolated nucleic acid molecule that is complementary to (a) or 

(b). 

Specifically the invention provides an isolated nucleic acid 
molecule comprising a first nucleotide sequence encoding a polypeptide 

3 
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of at least 301 amino acids that has at least 70% identity based on the 
Smith-Waterman method of alignment when compared to a polypeptide 
having the sequence as set forth in SEQ ID NO:4 or a second nucleotide 
sequence comprising the complement of the first nucleotide sequence, 
5 wherein said enzyme has c/s-prenyltransferase activity. 

In similar fashion the invention provides An isolated nucleic acid 
molecule comprising a first nucleotide sequence encoding a polypeptide 
of at least 168 amino acids that has at least 70% identity based on the 
Smith-Waterman method of alignment when compared to a polypeptide 
10 having the sequence as set forth in SEQ ID NO:6 or a second nucleotide 
sequence comprising the complement of the first nucleotide sequence, 
wherein said enzyme has c/s-prenyltransferase activity. 

Additionally the invention provides polypeptides encoded by the 
isolated nucleic acid molecules of the invention as well as genetic chimera 
15 constructed therefrom and recombinant host cells containing and 
expressing the same. 

In another embodiment the invention provides a method of 
obtaining a nucleic acid molecule encoding a c/s-prenyltransferase 
- enzyme comprising: 
20 a) probing a genomic library with the nucleic acid molecule of 

the invention; 

b) identifying a DNA clone that hybridizes with the nucleic acid 
molecule of the invention; 

c) sequencing the genomic fragment that comprises the clone 
25 identified in step (b), 

wherein the sequenced genomic fragment encodes a c/s- 
prenyltransferase enzyme. 

In similar fashion the invention provides a method of obtaining a 
nucleic acid molecule encoding a c/s-prenyltransferase enzyme 
30 comprising: 

a) synthesizing at least one oligonucleotide primer 
corresponding to a portion of the sequence selected from the group 
consisting of SEQ ID NOs:3 and 5; and 

b) amplifying an insert present in a cloning vector using the 
35 oligonucleotide primer of step (a); 

wherein the amplified insert encodes a portion of an amino acid 
sequence encoding a c/s-prenyltransferase enzyme. 
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In a preferred embodiment the invention provides a method of 
altering the level of expression of a plant c/s-prenyltransferase protein in a 
host cell comprising: 

(a) transforming a host cell with the chimeric gene of the 
5 invention and; 

(b) growing the transformed host cell produced in step (a) under 
conditions that are suitable for expression of the chimeric gene 
resulting in production of altered levels of a plant c/s-prenyl- 
transferase protein in the transformed host cell relative to 

10 expression levels of an untransformed host cell. 

In a preferred embodiment the invention provides amethod for the 
production of natural rubber compounds comprising: 

a) providing a transformed host cell comprising: 

(i) suitable levels of isopentenyl pyrophate; and 
15 (ii) a c/s-prenyltransferase gene selected from the group 

consisting of SEQ ID NOs: 3 and 5, wherein said genes are 
operably linked to suitable regulatory sequences; and 

b) growing the transformed host cell of (a) under conditions 
whereby a natural rubber compound is produced. 

20 Similarly the invention provides a method for the identification of a 

polypeptide having c/s-prenyltransferase activity in a rubber-producing 
plant comprising: 

(a) obtaining the amino acid sequence of a polypeptide 
suspected of having c/s-prenyltransferase activity; and 
25 (b) aligning the amino acid sequence of step (a) with the amino 

acid sequence of a c/s-prenyltransferase consensus sequence 
selected from the group consisting of SEQ ID NO:4, 6, 8, 9, and 10, 
wherein the alignment shows the presence of conserved domains I, 
IV, and V (SEQ ID NOs: 38-40). 
30 In an alternate embodiment the invention provides a method for the 

identification of a polypeptide having c/s-prenyltransferase activity in a 
^rubber-producing plant comprising: 

(a) obtaining the amino acid sequence of a polypeptide 
suspected of having c/s-prenyltransferase activity; and 
35 (b) aligning the amino acid sequence of step (a) with the amino 

acid sequence of a c/s-prenyltransferase consensus sequence 
selected from the group consisting of SEQ ID NO:4, 6, 8, 9, and 10, 
wherein the alignment shows a sequence of at least about 50 non- 
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conserved amino acids present between the absolutely conserved 
tyrosine of Domain IV and the first of the absolutely conserved 
arginine residue of Domain V. 

BRIEF DESCRIPTION OF THE DRAWINGS 
5 AND SEQUENCE DESCRIPTIONS 

Figure 1 shows an alignment of the regions between Domains IV 
and V of c/s-prenyltransferases from rubber-producing plants (i.e., russian 
dandelion, sunflower and Hevea) and non-rubber-producing plants and 
microbes. 

10 Figure 2 shows the analysis of expression of the russian dandelion 

c/s-prenyltransferase gene by Northern blotting. 

The invention can be more fully understood from the following 
detailed description and the accompanying sequence descriptions, which 
form a part of this application. 

15 The following sequences comply with 37 C.F.R. 1.821-1.825 

("Requirements for Patent Applications Containing Nucleotide Sequences 
and/or Amino Acid Sequence Disclosures - the Sequence Rules") and are 
consistent with World Intellectual Property Organization (WIPO) Standard 
ST.25 (1998) and the sequence listing requirements of the EPO and PCT 

20 (Rules 5.2 and 49.5(a-bis), and Section 208 and Annex C of the 

Administrative Instructions). The symbols and format used for nucleotide 
and amino acid sequence data comply with the rules set forth in 
37 C.F.R. §1.822. 

SEQ ID NOs:1-34, 38-40 and 45 are genes or proteins as identified 

25 in Table 1 . 



Table 1 

Summary of Gene and Protein SEQ ID Numbers 



Clone ID number and 
Description 


Organism 


SEQ ID 
Nucleic 
acid 


SEQ ID 
Peptide 


EST etk1c.pk006.a10 


Taraxacum kok-saghyz 
(russian dandelion) 


1 




5'RACE product #3-4 


Taraxacum kok-saghyz 
(russian dandelion) 


2 
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Clone ID number and 
Description 


Organism 


SEQ ID 
Nucleic 
acid 


SEQ ID 
Peptide 


full-length nucleotide 
sequence for cis- 
prenyltransferase 
(assembled from SEQ 
ID NO: 1 and SEQ ID 
NO: 2) 


Taraxacum kok-saghyz 
(russian dandelion) 

- 


3 


4 j 


hls1c.pk020.m9 


Helianthus annus 
(sunflower) 


5 


6 


ecs1c.pk009.p19 


Calendula officinalis 
(pot marigold) 


— 


7 


ehb2c.pk001.i10 


Hevea brasiliensis 


— 


8 


ehb2c.pk001.d17 


Hevea brasiliensis 


— 


9 


ehb2c.pk001.o18 


Hevea brasiliensis 




10 


vdb1c.pk001.k23 


Vitis sp. (grape) 


— 


11 


r10n.pk117.i23 


Oryza sativa (rice) 




12 


rr1.pk0050.h8 


Oryza sativa (rice) 


— 


13 


sl1.pk0128.h7 


Glycine max (soybean) 




14 


Wdk5c.pk005.f22 


Tnticum aestivum 
(wheat) 


— 


15 


ecs1c.pk009.p19 


Dimorphotheca sinuata 
(african daisy) 


- — 


16 


bacterial undecaprenyl 
diphosphate synthase 


Micrococcus luteus 


17 


18 


undecaprenyl 
phosphate synthase 


Saccharomyces 
cerevisiae, strain rer2 


19 


20 


undecaprenyl 
phosphate synthase 


Saccharomyces 
cerevisiae, strain srt1 


21 


22 


MUF9.18 


Arabidopsis (Genbank 
Accession No. 
NM_1 25443) 




23 


MJB20.13 


Arabidopsis (Genbank 
Accession No. 
NM_1 27311) 




24 


F26B6.6 


Arabidopsis (Genbank 
Accession No. 
NM_1 27905) 




25 


MZN1.22 


Arabidopsis (Genbank 
Accession No. 
NM_1 25267) 




26 


conserved Domain IV 


alignment consensus 
sequence 




27 | 
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Clone ID number and 
Description 


Organism 


SEQ ID 
Nucleic 
acid 


SEQ ID 
Peptide 


conserved Domain V 


alignment consensus 
sequence 




28 


conserved Domain 1 


consensus sequence 
from Apfel et al. (J. Bact. 
182(2):483-492 (1999)) 


— 


. 29 


conserved Domain li 


consensus sequence 
from Apfel et al. (supra) 




30 


conserved Domain III 


consensus sequence 
from Apfel et al. (supra) 




31 


conserved Domain IV 


consensus sequence 
from Apfel et al. (supra) 




32 


conserved Domain V 


consensus sequence 
from Apfel et al. (supra) 


— 


33 


Conserved Domain V 


Consensus sequence 
from Taraxacum kok- 

saghyz 
(russian dandelion) and 
Helianthus annus 
(sunflower) ESTs 




34 


conserved Domain I 


consensus sequence 
in rubber-producing 
species 


• 


38 


conserved Domain IV 


consensus sequence 
in rubber-producing 
species 




39 


conserved Domain V 


consensus sequence 
in rubber-producing 
species 




40 


Clone #4-4 
(RTPCR product) 


Taraxacum kok-saghyz 
(russian dandelion) latex 




45 



SEQ ID NOs:41 and 42 are the primers Dan5 and Dan6. 

SEQ ID NOs: 36, 37, and 44 are the primers NKH46, NKH45, and 

NKH5. 

5 SEQ ID NO:43 is the primer DegHptS. 

SEQ ID NO:35 is the peptide 'ELVISLIVES'. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention reports the isolation and characterization of 
cDNAs corresponding to c/s-prenyltransferases from russian dandelion 
10 and sunflower. Applications for these genes include the development of 

8 
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novel plant phenotypes possessing greater plant defense responses, crop 
production, and/or creation of industrial sources of polyisoprenoids 
(including natural rubber). Furthermore,* the presentinvention provides a 
technique for readily identifying other c/s-prenyltransferase genes from 
5 rubber-producing plants. 
Definitions 

The following definitions are provided for the full understanding of 
terms and abbreviations used in this specification: 

"Polymerase chain reaction" is abbreviated PCR. 
10 "Open reading frame" is abbreviated ORF. 

"Expressed sequence tag" is abbreviated EST. 
"SDS polyacrylamide gel electrophoresis" is abbreviated 
SDS-PAGE. 

"UPPS" is the abbreviation for the specific undecaprenyl 
15 diphosphate synthases isolated from bacteria. 

"Dimethyl allyl diphosphate" is abbreviated DMAPP. 
"Isopentenyl diphosphate" is abbreviated I PP. 
"Geranyl diphosphate" is abbreviated GPP. 
"Farnesyl diphosphate" is abbreviated FPP. 
20 "Geranylgeranyl diphosphate" is abbreviated GGPP. 

"Polyisoprenoids" refer to a variety of hydrocarbons produced by 
plants that are built up of isoprene units (C 5 H 8 ) (Tanaka, Y. In Rubber and 
Related Polyprenols. Methods in Plant Biochemistry, Dey, P. M. and 
Harborne, J. B., Eds., Academic Press: San Diego, 1991; Vol. 7, 
25 pp 51 9-536). Those with 45 to 1 15 carbon atoms and varying numbers of 
c/s- and trans- (Z- and E-) double bonds are termed "polyprenols", while 
those polyisoprenoids of longer chain length are termed natural "rubbers" 
(Tanaka, Y. In Minor Classes of Terpenoids. Methods in Plant 
Biochemistry] Dey, P. M. and Harborne, J. B., Eds., Academic: San Diego, 
30 1991; Vol. 7, pp 537-542). There are several suggested functions for 
plant polyisoprenoids. For example, terpenoid quinones are most likely 
involved in photophosphorylation and respiratory chain phosphorylation, 
while rubbers have been implicated in plant defense against herbivory, by 
possibly serving to repel and entrap insects and seal wounds in a manner 
35 analogous to plant resins. The specific roles of the C45-C-115 polyprenols, 
however, remain unidentified (although as with most secondary 
metabolites they too most likely function in plant defense). Short-chain 
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polyprenols may also be involved in protein glycosylation in plants, by 
analogy with the role of dolichols in animal metabolism. 

The term "rubber" encompasses any material that is highly elastic; 
i.e., the elastic material can be stretched without breaking and will return 
5 to its original length on removal of the stretching force. "Natural rubbers" 
are those rubbers produced by plant species, often (though not always) as 
a constituent of latex. 

The term "plant latex" refers to a milky fluid present in lacticifers, or 
latex ducts, which seeps out of the plant upon wounding. 
10 The term "c/s-prenyltransferase" refers generally to a class of 

enzymes capable of catalyzing the sequential addition of C 5 units to 
polyprenols and rubbers in cis 1-4 orientation. Two examples of 
c/s-prenyltransferases are the undecaprenyl diphosphate and 
dehydrodolichyl diphosphate synthase. 
15 The term "initiator molecules" or "initiators" refers to allylic 

terpenoid diphosphates that are derived from isoprene units (IPP) through 
the action of prenyltransferases. Examples of common initiators include: 
dimethylallyldiphosphate (DMAPP), a C 5 compound; geranyl diphosphate 
(GPP), a C 10 compound; farnesyl diphosphate (FPP), a C 15 compound; 
20 and, geranylgeranyl diphosphate (GGPP), a C 20 compound. 

The term "plant defense response" refers to the ability of a plant to 
deter tissue damage by insects, pathogens (e.g., fungi, bacteria or 
viruses), and/or herbivores. 

As used herein, an "isolated nucleic acid fragment" is a polymer of 
25 RNA or DNA that is single- or double-stranded, optionally containing 
synthetic, non-natural or altered nucleotide bases. An isolated nucleic 
acid fragment in the form of a polymer of DNA may be comprised of one 
or more segments of cDNA, genomic DNA or synthetic DNA. 

The term "fragment" refers to a DNA or amino acid sequence 
30 comprising a subsequence of the nucleic acid sequence or protein of the 
present invention. However, an active fragment of the present invention 
comprises a sufficient portion of the protein to maintain activity. 

A nucleic acid molecule is "hybridizable" to another nucleic acid 
molecule, such as a cDNA, genomic DNA, or RNA molecule, when a 
35 single stranded form of the nucleic acid molecule can anneal to the other 
nucleic acid molecule under the appropriate conditions of temperature and 
solution ionic strength. Hybridization and washing conditions are well 
known and exemplified in Sambrook, J., Fritsch, E. F. and Maniatis, T. 

10 
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(Molecular Cloning: A Laboratory Manual 2" ed.; Cold Spring Harbor 
Laboratory: Cold Spring Harbor, NY, 1989 (hereinafter "Maniatis"), 
particularly Chapter 11 and Table 11.1 therein (entirely incorporated 
herein by reference). The conditions of temperature and ionic strength 
5 determine the "stringency" of the hybridization. Stringency conditions can 
be adjusted to screen for moderately similar fragments (such as 
homologous sequences from distantly related organisms), to highly similar 
fragments (such as genes that duplicate functional enzymes from closely 
related organisms). Post-hybridization washes determine stringency 

10 conditions. One set of preferred conditions uses a series of washes 
starting with 6X SSC, 0.5% SDS at room temperature for 15 min, then 
repeated with 2X SSC, 0.5% SDS at 45°C for 30 min, and then repeated 
twice with 0.2X SSC, 0.5% SDS at 50°C for 30 min. A more preferred set. 
of stringent conditions uses higher temperatures in which the washes are 

15 identical to those above except for the temperature of the final two 30 min 
washes in 0.2X SSC, 0.5% SDS was increased to 60°C. Another 
preferred set of highly stringent conditions uses two final washes in 0.1 X 
SSC, 0.1% SDS at 65°C. An additional set of stringent conditions include 
hybridization at 0.1 X SSC, 0.1% SDS, 65°C and washed with 2X SSC, 

20 0.1% SDS followed by 0.1X SSC, 0.1% SDS, for example. 

Hybridization requires that the two nucleic acids contain 
complementary sequences, although depending on the stringency of the 
hybridization, mismatches between bases are possible. The appropriate 
stringency for hybridizing nucleic acids depends on the length of the 

25 nucleic acids and the degree of complementation, variables well known in 
the art. The greater the degree of similarity or homology between two 
nucleotide sequences, the greater the value of Tm for hybrids of nucleic 
acids having those sequences. The relative stability (corresponding to 
higher Tm) of nucleic acid hybridizations decreases in the following order: 

30 RNA:RNA, DNA:RNA, DNA:DNA. For hybrids of greater than 

100 nucleotides in length, equations for calculating Tm have been derived 
(see Maniatus, supra, 9.50-9.51). For hybridizations with shorter nucleic 
acids, i.e., oligonucleotides, the position of mismatches becomes more 
important, and the length of the oligonucleotide determines its specificity 

35 (see Maniatus, supra, 1 1 .7-1 1.8). In one embodiment the length for a 
hybridizable nucleic acid is at least about 10 nucleotides. Preferably a 
minimum length for a hybridizable nucleic acid is at least about 
15 nucleotides; more preferably at least about 20 nucleotides; and most 
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preferably the length is at least 30 nucleotides. Furthermore, the skilled 
artisan will recognize that the temperature and wash solution salt 
concentration may be adjusted as necessary according to factors such as 
length of the probe. 

A "substantial portion" of an amino acid or nucleotide sequence is 
that portion comprising enough of the amino acid sequence of a 
polypeptide or the nucleotide sequence of a gene to putatively identify that 
polypeptide or gene, either by manual evaluation of the sequence by one 
skilled in the art, or by computer- automated sequence comparison and 
identification using algorithms such as BLAST (Basic Local Alignment 
Search Tool; Altschul, S. F., et al., J. Mol. Biol. 215:403-410 (1993); see 
also www.ncbi.nlm.nih.gov/BLAST/). In general, a sequence often or 
more contiguous amino acids or thirty or more nucleotides is necessary in 
order to putatively identify a polypeptide or nucleic acid sequence as 
homologous to a known protein or gene. Moreover, with respect to 
nucleotide sequences, gene specific oligonucleotide probes comprising 
20-30 contiguous nucleotides may be used in sequence-dependent 
methods of gene identification (e.g., Southern hybridization) and isolation 
(e.g., in situ hybridization of bacterial colonies or bacteriophage plaques). 
In addition, short oligonucleotides of 12-15 bases may be used as 
amplification primers in PCR in order to obtain a particular nucleic acid 
fragment comprising the primers. Accordingly, a "substantial portion." of a 
nucleotide sequence comprises enough of the sequence to specifically 
identify and/or isolate a nucleic acid fragment comprising the sequence. 
The instant specification teaches partial or complete amino acid and 
nucleotide sequences encoding one or more particular plant proteins. The 
skilled artisan, having the benefit of the sequences as reported herein, 
may now use ail or a substantial portion of the disclosed sequences for 
purposes known to those skilled in this art. Accordingly, the instant 
invention comprises the complete sequences as reported in the 
accompanying Sequence Listing, as well as substantial portions of those 
sequences as defined above. 

The term "complementary" is used to describe the relationship 
between nucleotide bases that are capable of hybridizing to one another. 
For example, with respect to DNA, adenosine is complementary to 
thymine and cytosine is complementary to guanine. Accordingly, the 
instant invention also includes isolated nucleic acid fragments that are 
complementary to the complete sequences as reported in the 
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accompanying Sequence Listing as well as those substantially similar 
nucleic acid sequences. 

The term "percent identity", as known in the art, is a relationship 
between two or more polypeptide sequences or two or more 
5 polynucleotide sequences, as determined by comparing the sequences. 
In the art, "identity" also means the degree of sequence relatedness 
between polypeptide or polynucleotide sequences, as the case may be, as 
determined by the match between* strings of such sequences. "Identity" 
and "similarity" can be readily calculated by known methods, including (but 

10 not limited to) those described in: 1 .) Computational Molecular Biology : 
Lesk, A. M., Ed.; Oxford University: NY, 1988; 2.) Biocomputina: 
Informatics and Genome Projects: Smith. D. W., Ed.; Academic: NY, 
1993; 3.) Computer Analysis of Sequence Data. Part I: Griffin. A. M., and 
Griffin, H. G., Eds.; Humana: NJ, 1994; 4.) Sequence Analysis in 

15 Molecular Biology: von Heinje, G., Ed.; Academic, 1987; and 

5.) Sequence Analysis Primer: Gribskov, M. and Devereux, J., Eds.; 
Stockton: NY, 1 991 . Preferred methods to determine identity are 
designed to give the best match between the sequences tested. Methods 
to determine identity and similarity are codified in publicly available 

20 computer programs. Sequence alignments and percent identity 

calculations may be performed using the AlignX program of the Vector 
NTI bioinformatics computing suite (InforMax Inc., North Bethesda, MD). 
Multiple alignment of the sequences was performed using the Clustal 
method of alignment (Higgins and Sharp CABIOS. 5:151-153 (1989)) with 

25 the default parameters (GAP OPENING PENALTY=10, GAP EXTENSION 
PENALTY=0.1). Default parameters for pairwise alignments using the 
Clustal method were KTUPLE SIZE=1, GAP PENALTY=3, WINDOW 
SIZE=5 and NUMBER OF BEST DIAGONALS=5. 

Suitable nucleic acid fragments (isolated polynucleotides of the 

30 present invention) encode polypeptides that are at least about 70% 
identical, preferably at least about 80% identical to the amino acid 
sequences reported herein. Preferred nucleic acid fragments encode 
amino acid sequences that are about 85% identical to the amino acid 
sequences reported herein. More preferred nucleic acid fragments 

35 encode amino acid sequences that are at least about 90% identical to the 
amino acid sequences reported herein. Most preferred are nucleic acid 
fragments that encode amino acid sequences that are at least about 95% 
identical to the amino acid sequences reported herein. Suitable nucleic - 
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acid fragments not only have the above homologies but typically encode a 
polypeptide having at least 50 amino acids, preferably at least 100 amino 
acids, more preferably at least 150 amino acids, still more preferably at 
least 200 amino acids, and most preferably at least 250 amino acids. 
5 "Codon degeneracy" refers to the divergence in the genetic code 

permitting variation of the nucleotide sequence without effecting the amino 
acid sequence of an encoded polypeptide. Accordingly, the instant 
invention relates to any nucleic acid fragment that encodes all or a 
substantial portion of the amino acid sequence encoding the instant plant 
10 polypeptides as set forth in SEQ ID NOs:4 and 6. The skilled artisan is 
well aware of the "codon-bias" exhibited by a specific host cell in usage of 
nucleotide codons to specify a given amino acid. Therefore, when 
synthesizing a gene for improved expression in a host cell, it is desirable 
to design the gene such that its frequency of codon usage approaches the 
15 frequency of preferred codon usage of the host cell. 

"Synthetic genes" can be assembled from oligonucleotide building 
blocks that are chemically synthesized using procedures known to those 
skilled in the art. These building blocks are ligated and annealed to form 
gene segments that are then enzymatically assembled to construct the 
20 entire gene. "Chemically synthesized", as related to a sequence of DNA, 
means that the component nucleotides were assembled in vitro. Manual 
chemical synthesis of DNA may be accomplished using well-established 
procedures, or automated chemical synthesis can be performed using one 
of a number of commercially available machines. Accordingly, the genes 
25 can be tailored for optimal gene expression based on optimization of 

nucleotide sequence to reflect the codon bias of the host cell. The skilled 
• artisan appreciates the likelihood of successful gene expression if codon 
usage is biased towards those codons favored by the host. Determination 
of preferred codons can be based on a survey of genes derived from the 
30 host cell where sequence information is available. 

"Gene" refers to a nucleic acid fragment that expresses a specific 
protein, including regulatory sequences preceding (5' non-coding 
sequences) and following (3' non-coding sequences) the coding 
sequence. "Native gene" refers to a gene as found in nature with its own 
35 regulatory sequences. "Chimeric gene" refers to any gene that is not a 
native gene, comprising regulatory and coding sequences that are not 
found together in nature. Accordingly, a chimeric gene may comprise 
regulatory sequences and coding sequences that are derived from 
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different sources, or regulatory sequences and coding sequences derived 
from the same source, but arranged in a manner different than that found 
in nature. "Endogenous gene" refers to a native gene in its natural 
location in the genome of an organism. A "foreign" gene refers to a gene 

5 not normally found in the host organism, but that is introduced into the 
host organism by gene transfer. Foreign genes can comprise native 
genes inserted into a non-native organism, or chimeric genes. A 
"transgene" is a gene that has been introduced into the genome by a 
transformation procedure. 

10 "Coding sequence" refers to a DNA sequence that codes for a 

specific amino acid sequence. "Suitable regulatory sequences" refer to 
nucleotide sequences located upstream (5 1 non-coding sequences), 
within, or downstream (3* non-coding sequences) of a coding sequence, 
and which influence the transcription, RNA processing or stability, or 

15 translation of the associated coding sequence. Regulatory sequences 
may include promoters, translation leader sequences, introns, 
polyadenylation recognition sequences, RNA processing site, effector 
binding site and stem-loop structure. 

"Promoter" refers to a DNA sequence capable of controlling the 

20 expression of a coding sequence or functional RNA. In general, a coding 
sequence is located 3' to a promoter sequence. Promoters may be 
derived in their entirety from a native gene, or be composed of different 
elements derived from different promoters found in nature, or even 
comprise synthetic DNA segments. It is understood by those skilled in the 

25 art that different promoters may direct the expression of a gene in different 
tissues or cell types, or at different stages of development, or in response 
to different environmental or physiological conditions. Promoters which 
cause a gene to be expressed in most cell types at most times are 
commonly referred to as "constitutive promoters". It is further recognized 

30 that since in most cases the exact boundaries of regulatory sequences 
. have not been completely defined, DNA fragments of different lengths 
may have identical promoter activity. 

The "3* non-coding sequences" refer to DNA sequences located 
downstream of a coding sequence and include polyadenylation 

35 recognition sequences and other sequences encoding regulatory signals 
capable of affecting mRNA processing or gene expression. The 
polyadenylation signal is usually characterized by affecting the'addition of 
polyadenylic acid tracts to the 3' end of the mRNA precursor. The use of 
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different 3' non-coding sequences is exemplified by Ingelbrecht et al 
{Plant Cell 1 :671-680 (1 989)). 

"RNA transcript" refers to the product resulting from RNA 
polymerase-catalyzed transcription of a DNA sequence. When the RNA 
transcript is a perfect complementary copy of the DNA sequence, it is 
referred to as the primary transcript or it. may be a RNA sequence derived 
from post-transcriptional processing of the primary transcript and is 
referred to as the mature RNA. "Messenger RNA (mRNA)" refers to the 
RNA that is without introns and that can be translated into protein by the 
cell. "cDNA" refers to a double-stranded DNA that is complementary to 
and derived from mRNA. "Sense" RNA refers to RNA transcript that 
includes the mRNA and so can be translated into protein by the cell 
"Antisense RNA" refers to a RNA transcript that is complementary to all or 
part of a target primary transcript or mRNA and that blocks the expression 
of a target gene (U.S. Patent No. 5,107,065; WO 9928508). The 
complementarity of an antisense RNA may be with any part of the specific 
gene transcript, i.e., at the 5' non-coding sequence, 3* non-coding 
sequence, or the coding sequence. "Functional RNA" refers to antisense 
RNA, ribozyme RNA, or other RNA that is not translated yet has an effect 
20 on cellular processes. 

The term "operably linked" refers to the association of nucleic acid 
sequences on a single nucleic acid fragment so that the function of one is 
affected by the other. For example, a promoter is operably linked with a 
coding sequence when it is capable of affecting the expression of that 
25 coding sequence (i.e., that the coding sequence is under the 

transcriptional control of the promoter). Coding sequences can be 
operably linked to regulatory sequences in sense or antisense orientation. 

The term "expression'', as used herein, refers to the transcription 
and stable accumulation of sense (mRNA) or antisense RNA derived from 
the nucleic acid fragment of the invention. Expression may also refer to 
translation of mRNA into a polypeptide. "Antisense inhibition" refers to the 
production of antisense RNA transcripts capable of suppressing the 
expression of the target protein. "Overexpression" refers to the production 
of a gene product in transgenic organisms that exceeds levels of 
35 production in normal or non-transformed organisms. "Co-suppression- 
refers to the production of sense RNA transcripts capable of suppressing 
the expression of identical or substantially similar foreign or endogenous 
genes (U.S. 5,231,020). 
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The term "altered biological activity" will refer to an activity, 
associated with a protein encoded by a microbial nucleotide sequence 
which can be measured by an assay method, where that activity is either 
greater than or less than the activity associated with the native microbial 
5 sequence. "Enhanced biological activity" refers to an altered activity that 
is greater than that associated with the native sequence. "Diminished 
biological activity" is an altered activity that is less than that associated 
with the native sequence. 



10 polypeptide; i.e., one from which any pre- or propeptides present in the 

primary translation product have been removed. "Precursor" protein refers 
to the primary product of translation of mRNA; i.e., with pre- and 
propeptides still present. Pre- and propeptides may be but are not limited 
to intracellular localization signals. 

15 A "chloroplast transit peptide" is an amino acid sequence which is 

translated in conjunction with a protein and directs the protein to the 
chloroplast or other plastid types present in the cell in which the protein is 
made. "Chloroplast transit sequence" refers to a nucleotide sequence that 
encodes a chloroplast transit peptide. 

20 "Transformation" refers to the transfer of a nucleic acid fragment 

into the genome of a host organism, resulting in genetically stable 
inheritance. Host organisms containing the transformed nucleic acid 
fragments are referred to as "transgenic" or "recombinant" or 
"transformed" organisms. 

25 The terms "plasmid", "vector" and "cassette" refer to an extra 

chromosomal element often carrying genes which are not part of the 
central metabolism of the cell, and usually in the form of circular double- 
stranded DNA fragments. Such elements may be autonomously 
replicating sequences, genome integrating sequences, phage or 

30 nucleotide sequences, linear or circular, of a single- or double-stranded 
DNA or RNA, derived from any source, in which a number of nucleotide 
sequences have been joined or recombined into a unique construction 
which is capable of introducing a promoter fragment and DNA sequence 
for a selected gene product along with appropriate 3' untranslated 

35 sequence into a cell. "Transformation cassette" refers to a specific vector 
containing a foreign gene and having elements in addition to the foreign 
gene that facilitate transformation of a particular host cell. "Expression 
cassette" refers to a specific vector containing a foreign gene and having 



Mature" protein refers to a post-translationally processed 
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elements in addition to the foreign gene that allow for enhanced 
expression of that gene in a foreign host. 

The term "sequence analysis software" refers to any computer 
algorithm or software program that is useful for the analysis of nucleotide 
or amino acid sequences. "Sequence analysis software" may be 
commercially available or independently developed. Typical sequence 
analysis software will include but is not limited to the GCG suite of 
programs (Wisconsin Package Version 9.0, Genetics Computer Group 
(GCG), Madison, Wl), BLASTP, BLASTN, BLASTX (Altschul et al J Mol. 
Biol. 215:403-410 (1990), Vector NTI (inforMax Inc., North Bethesda MD) 
and DNASTAR (DNASTAR Inc., Madison, Wl). Within the context of this 
application it will be understood that where sequence analysis software is 
used for analysis, that the results of the analysis will be based on the 
"default values" of the program referenced, unless otherwise specified 
As used herein "default vales" will mean any set of values or parameters 
which originally load with the software when first initialized. 

The term "conserved domain" means a set of amino acids 
conserved at specific positions along an aligned sequence of 
evolutionary related proteins. While amino acids at other positions can 
vary between homologous proteins, amino acids that are highly conserved 
at specific positions indicate amino acids that are essential in the 
structure, the stability, or the activity of a protein. Because they are 
.dentified by their high degree of conservation in aligned sequences of a 
fam.ly of protein homologues.'they can be used as identifiers, or 
"signatures", to determine if a protein with a newly determined sequence 
belongs to a previously identified protein family. Conserved domains of 
are specifically described for the family of c/s-prenyltransferases, 
according to the work of Apfel, CM. et al. (J. Bact: 181(2)- 483-492 
(1999)). 

The term "non-conserved domain" means a set of amino acids 
present between conserved domains, which whilst the individual amino 
acids are not conserved at specific positions along an aligned sequence of 
evolut,onarily related proteins, is recognizable by its presence or absence 
in aligned sequences of evolutionary related proteins. The presence of 
such a domain, despite positional non-conservation among its constituent 
amino acids, indicates that the domain plays a role essential in the 
structure, the stability, or the activity of a protein, e.g., by increasing the 
distance between other (conserved) domains. Because they are identified 
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by their presence in aligned sequences of a family of protein homologues, 
they can be used as identifiers, or'signatures", to determine if a protein 
with a newly determined sequence belongs to a previously identified 
protein family or subfamily. In the present invention, non-conserved 
5 domains are specifically described for c/s-prenyltransferases from rubber- 
producing plants. 

Standard recombinant DNA and molecular cloning techniques used 
here are well known in the art and are described by: Sambrook, J., Fritsch, 
E. F. and Maniatis, T., Molecular Cloning: A Laboratory Manual ; 2nd ed.; 

10 Cold Spring Harbor Laboratory: Cold Spring Harbor, NY, 1989 (hereinafter 
"Maniatis"); and by Silhavy, T. J., Bennan, M. L and Enquist, L. W., 
Experiments with Gene Fusions : Cold Spring Harbor Laboratory: Cold 
Spring Harbor, NY, 1984; and by Ausubel, F. M. et al., Current Protocols 
in Molecular Biology; Greene Publishing Assoc. and Wiley-lnterscience 

15 (1987). 

C/s-Prenvltransferase Sequence Identification 

Novel nucleotide sequences have been isolated from the rubber- 
producing plants Taraxacum kok-saghyz (russian dandelion) and 
Helianthus annus (sunflower) encoding gene products involved in the 

20 production of natural rubbers. More specifically, these unique plant 

homologs of microbial c/s-prenyltransferase proteins are involved in the 
synthesis of poly c/s-isoprenoids. Classification of the proteins is based 
on alignments which reveal the presence of five conserved domains, 
indicative of a c/s-prenyltransferase, as described by Apfel et al. (J. Bad 

25 181(2): 483-492 (1999)). 

Comparison of the dandelion c/s-prenyltransferase nucleotide base 
and deduced amino acid sequences to public databases reveals that the 
most similar known sequences are about 50% identical to the amino acid 
sequence of SEQ ID NO:4 reported herein over a length of 301 amino 

30 acids using a Smith-Waterman alignment algorithm (W. R. Pearson, 

Comput Methods Genome Res., [Proc. Int. Symp.] (1994), Meeting Date 
1992, 111-20. Suhai, S., Ed.; Plenum: New York, NY). Strong correlation 
was seen between the instant sequences and the c/s-prenyltransferase 
genes and proteins isolated from Micrococcus luteus (SEQ ID NOs:17 and 

35 18, encoding undecaprenyl diphosphate synthase; Shimizu, N., et al., 
J. Biol. Chem. 273:19476-19481 (1998)) and Saccharomyces cerevisiae 
(SEQ ID NOs: 19-22). 
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In like manner, comparison of the sunflower c/s-prenyltransferase 
nucleotide base and deduced amino acid sequences to public databases 
reveals that the most similar known sequences are about 57% identical to 
the amino acid sequence of SEQ ID NO: 6 reported herein over a length 
5 of 168 ammo acids using a Smith-Waterman alignment algorithm. Again 
strong correlation was noted between the instant sequences and the ' 
c^prenyltransferase genes and proteins isolated from Micrococcus luteus 
(SEQ ID NOS.-17 and 18; Shimizu, N., et al., supra) and Saccharomyces 
cerevisiae (SEQ ID NOs: 1 9-22). 

' More preferred c/s-prenyltransferase amino acid fragments are at 

least about 70 % -80% identical to the sequences herein, where about 
80 /o-90% is preferred. Most preferred are nucleic acid fragments that are 
at least 95% identical to the amino acid fragments reported herein 

Similarly, preferred c/s-prenyltransferase encoding nucleic acid 
sequences corresponding to the instant ORFs are those encoding active 
proteins and which are at least 80% identical to the nucleic acid 
sequences of reported herein. More preferred c/s-prenyltransferase 
nucleic acid fragments are at least 90% identical to the sequences herein 
Mos preferred are c/s-prenyltransferase nucleic acid fragments that are at 
least 95% identical to the nucleic acid fragments reported herein 
Isolation of Homolog y 

The nucleic acid fragments of the present invention may be used to 
isolate cDNAs and genes encoding homologous prenyltransferases from 
the same or other plant species or from microbial species. Isolating 
homologous genes using sequence-dependent protocols is well known in 
he art. Examples of sequence-dependent protocols include (but are not 
Hmited to) methods of nucleic acid hybridization and methods of DNA and 

amplification, as exemplified by various uses of nucleic acid 
amplification technologies (e.g. polymerase chain reaction (PCR) Mullis 
et al., U.S. Patent 4,683,202; ligase chain reaction (LCR), Tabor S et al 
Proc Acad. Sci, USA 82:1074, (1985); or strand displacement ' * " 

( a ^92)) Cati ° n (SDA) ' Wa ' ker ' ^ a '"' Pr °°' NatL SCL U S A ' 89: 392 ' 

For example, other c/s-prenyltransferase genes sharing significant 
homology to those of the instant invention, either as cDNAs or genomic 
DMAs, could be isolated directly by using all or a portion of the instant 
nucleic acid fragments as DNA hybridization probes to screen libraries 
from any desired plant using methodology well known to those skilled in 
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the art. Specific oligonucleotide probes based upon the instant nucleic 
acid sequences can be designed and synthesized by methods known in 
the art (Maniatus, supra). Moreover, the entire sequences can be used 
directly to synthesize DNA probes by methods known to the skilled artisan 
5 such as random primers, DNA labeling, nick translation, or end-labeling 
techniques, or RNA probes using available in vitro transcription systems. 
In addition, specific primers can be designed and used to amplify a part of 
(or full-length of) the present sequence. The resulting amplification 
products can be labeled directly during amplification reactions or labeled 

10 after amplification reactions, and used as probes to isolate full length DNA 
fragments under conditions of appropriate stringency. 

Typically, in PCR-type amplification techniques, the primers have 
different sequences and are not complementary to each other. 
Depending on the desired test conditions, the sequences of the primers 

15 should be designed to provide for both efficient and faithful replication of 
the target nucleic acid. Methods of PCR primer design are common and 
well known in the art. (Thein and Wallace, "The use of oligonucleotide as 
specific hybridization probes in the Diagnosis of Genetic Disorders", in 
Human Genetic Diseases: A Practical Approach] K. E. Davis, Ed.; IRL: 

20 Herndon, VA, 1986; pp 33-50); Rychlik, W., In Methods in Molecular 

Biology; PCR Protocols: Current Methods and Applications. White, B. A., 
Ed.; Humania: Totowa, NJ, 1993; Vol. 15, pp 31-39). 

Generally two short segments of the instant sequences may be 
used in polymerase chain reaction protocols to amplify longer nucleic acid 

25 fragments encoding homologous genes from DNA or RNA. The 

polymerase chain reaction may also be performed on a library of cloned 
nucleic acid fragments wherein the sequence of one primer is derived 
from the instant nucleic acid fragments, and the sequence of the other 
primer takes advantage of the presence of the polyadenylic acid tracts to 

30 the 3' end of the mRNA precursor encoding plant UPPS homologs. 

Alternatively, the second primer sequence may be based upon 
sequences derived from the cloning vector. For example, the skilled 
artisan can follow the RACE protocol (Frohman et al., Proc. Natl. Acad. 
Sci. USA 85:8998 (1988)) to generate cDNAs by using PCR to amplify 

35 copies of the region between a single point in the transcript and the 3' or 
5' end. Primers oriented in the 3' and 5' directions can be designed from 
the instant sequences. Using commercially available 3 1 RACE or 5' RACE 
systems (BRL), specifip 3* or 5' cDNA fragments can be isolated (Ohara 
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tt^no^ ^ SC! - USA 86:5673 <1989) '- Loh et al - s °<°>™ 
243.217 (1989)). Products generated by the 3' and 5' RACE procedures 

can be combined to generate full-length cDNAs (Frohman et al 
Techniques 1:165 (1989)). 

5 k u • Alternativel >' to* instant sequences may be employed as 
hybndization reagents for the identification of homologs The basic 
components of a nucleic acid hybridization test include a probe, a sample 
suspected of containing the gene or gene fragment of interest, and a 
specific hybndization method. Probes of the present invention are 

TZ ITT T ded nUC ' eiC a ° id SSqUenCes ,hat are elementary 
to the nucle,c acid sequences to be detected. Probes are "hybridizab J to 
he nucleic acid sequence to be detected. The probe length can vary torn 

tea to be done. Typically a probe length of about 15 bases to about 
15 30 bases is suitable. Only part of the probe molecule need be 

comptementary to the nucleic acid sequence to be detected. In addition 

^^STr ,he probe and ,he ,ar9et * 

perfect. Hybndizaf.cn does occur between imperfectly complementarv 
molecules w«h the result ma, a certain fraction of fhe bases in the " 
hybndized reg,on are not paired with the proper complementary base 
Hybnd,zat,on msthods are well defined. Typically the probe and 

hZiz7 St b T e h mlXed U " der COn<ffl ° nS WhiCh >*™* 
hybndization. This involves contacting the probe and sample in the 

andtT T ' n ° rganiC ° r ° rSaniC S3lt Under the concentration 
and temperature conditions. The probe and sample nucleic acids must be 

e C oro a b : ,0 " 9 en ° U9h that any P ° SSibte Nation beZel 
the probe and sample nucleic acid may occur. The concentration of probe 
or target ,n the mixture will determine the time necessaiy for hybridizafton 
o occur. The higher fhe probe or target concentration the shorterte 
hybndrzafion incubation time needed. OptionaHy a chaotropic agent may 
be added. The chaotropic agent stabilizes nucleic acids by inhibit 
nuclease activity. Furthermore, me chaotropic agent allows sensitive and 
stringent hybndizafion of short oligonucleotide probes a, room temperature 
(Van Ness and Chan, NuC. AciCs Re,, 19:5143-5151 (1991)). Suable 
chaotropic agents include guanidinium chloride, guanidinium thiocyana; 
sodium thiocyanata, lithium tetrachloroacetate. sodium perchlorate 
rubidium tetraohloroacetata, potassium iodide, and cesium 
trifluoroacetate. among others. Typically, the chaotropic agent will be 
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present at a final concentration of about 3M. If desired, one can add 
formamide to the hybridization mixture, typically 30-50% (v/v). 

Various hybridization solutions can be employed. Typically, these 
comprise from about 20 to 60% volume, preferably 30%, of a polar 
5 organic solvent. A common hybridization solution employs about 

30-50% v/v formamide, about 0.15 to 1M sodium chloride, about 0.05 to 
0.1M buffers, such as sodium citrate, Tris-HCI, PIPES or HEPES (pH 
range about 6-9), about 0.05 to 0.2% detergent, such as sodium 
dodecylsulfate, or between 0.5-20 mM EDTA, FICOLL (Pharmacia Inc.) 

10 (about 300-500 kilodaltons), polyvinylpyrrolidone (about 250-500 kdal), 
and serum albumin. Also included in the typical hybridization solution will 
be unlabeled carrier nucleic acids from about 0.1 to 5 mg/mL, fragmented 
nucleic DNA, e.g., calf thymus or salmon sperm DNA, or yeast RNA, and 
optionally from about 0.5 to 2% wt/vol glycine. Other additives may also 

15 be included, such as volume exclusion agents which include a variety of 
polar water-soluble or swellable agents, such as polyethylene glycol, 
anionic polymers such as polyacrylate or polymethylacrylate, and anionic 
saccharidic polymers, such as dextran sulfate. 

Nucleic acid hybridization is adaptable to a variety of assay 

20 formats. One of the most suitable is the sandwich assay format. The 
sandwich assay is particularly adaptable to hybridization under non- 
denaturing conditions. A primary component of a sandwich-type assay is 
a solid support. The solid support has adsorbed to it or covalently coupled 
to it immobilized nucleic acid probe that is unlabeled and complementary 

25 to one portion of the sequence. 

Finally, availability of the instant nucleotide and deduced amino 
acid sequences facilitates immunological screening of DNA expression 
libraries. Synthetic peptides representing portions of the instant amino 
acid sequence may be synthesized. These peptides can be used to 

30 immunize animals to produce polyclonal or monoclonal antibodies with 
specificity for peptides or proteins comprising the amino acid sequences. 
These antibodies can be then be used to screen DNA expression libraries 
to isolate full-length DNA clones of interest (Lerner et al., Adv. Immunol. 
36:1 (1984); Maniatus, supra). 

35 Recombinant Expression - Plants 

It is expected that introduction of chimeric genes encoding the 
instant c/s-prenyltransferase enzymes, under the control of the 
appropriate promoters, will enable increased production of natural rubbers 

23 



10 



15 



30 



35 



WO 2004/044173 

PCT/US2003/036164 

when an appropriate source of IPP is present in the cell to produce 
appropriate initiator molecules (DMAPP, GPP, FPP or GGPP). It is 
contemplated that it will be useful to express the instant genes both in 
natural host cells as well as heterologous plant hosts. 
5 The nucleic acid fragments of the instant invention may also be 

used to create transgenic plants in which any of the instant 
c/s-prenyltransferase proteins are present at higher or lower levels than 
normal, thus permitting modification to the production of natural rubbers 
Introduction of the nucleic acid fragments of the instant invention into 
transgenic plants may have benefit in modifying the rate or timing of 
rubber production, the amount and/or quality of the rubber produced 
and/or the allergenic properties of the resultant rubber. Alternatively' in 
some applications, it might be desirable to express any of the instant 
c/s-prenyltransferases in specific plant tissues and/or cell types or during 
developmental stages in which they would normally not be encountered 
The expression of full-length plant c/s-prenyltransferase cDNAs yields a 
mature protein capable of the synthesis of c/s-polyisoprenoids from IPP 
as the substrate. The presence of an initiator allylic isoprenoid 
diphosphate enhances this activity. 

Further, it is contemplated that transgenic plants expressing any of 
the .nstant c/s-prenyltransferase sequences will have altered or modulated 
defense mechanisms against various pathogens and natural predators 
For example, various latex proteins are known to be antigenic and 
recognized by IgE antibodies, suggesting their role in immunolgical 
defense (Yagami et al., Journal of Allergy and Clinical Immunology, 
101(3): 379-385 (1998)). Additionally it has been shown that a significant 
portion of the latex isolated from Hevea brasiliensis contains 
chitinases/lysozymes, which are capable of degrading the chitin 
component of fungal cell walls and the peptidoglycan component of 
bacterial cell walls (Martin, M. IM., Plant Physiol (Bethesda) 95 (2)- 
469-476 (1991)). It is therefore an object of the instant invention to 
provide transgenic plants having altered, modulated or increased 
defenses towards various pathogens and herbivores. 

Preferred Plant Hosts and Transformation M^thnHc 
Preferred plant hosts will be any variety that will support a high 
production level of the instant c/s-prenyltransferase sequences Suitable 
plant spec.es include those plant species Which produce natural rubber 
(e.g., Hevea brasiliensis, Taraxacum spp.), but are not limited to: tobacco 
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(Nicotiana spp.), tomato (Lycopersicon spp.), potato (Solatium spp.), 
hemp (Cannabis spp.), sunflower (Helianthus spp.), sorghum (Sorghum 
vulgare), wheat (Triticum spp.), maize (Zea /nays), rice (Oryza sativa), rye 
(Secale cereale), oats (yAvena spp.), barley (Hordeum vulgare), rapeseed 
5 (Brassica spp.), broad bean (Vicia faba), french bean (Phaseolus 

vulgaris), other bean species (Vigna spp.), lentil (Lens culinaris), soybean 
(Glycine max), arabidopsis (Arabidopsis thaliana), guayule (Parthenium 
argentatum), cotton (Gossypium hirsutum), petunia (Petunia hybrida), flax 
(Linum usitatissimum), and carrot (Daucus carota sativa). 

10 One skilled in the art recognizes that the expression level and 

regulation of a transgene in a plant can vary significantly from line to line. 
Thus, one has to test several lines to find one with the desired expression 
level and regulation. 

A variety of techniques are available and known to those skilled in 

15 the art for introduction of constructs into a plant cell host. These 

techniques include transformation with DNA employing A. tumefaciens or 
A. rhizogenes as the transforming agent, electroporation, particle 
acceleration, etc. (see, for example, EP 295959 and EP 138341). It is 
particularly preferred to use the binary type vectors of Ti and Ri plasmids 

20 of Agrobacterium spp. Ti-derived vectors transform a wide variety of 
higher plants, including monocotyledonous and dicotyledonous plants, 
such as soybean, cotton, rape, tobacco, and rice (Pacciotti et al., 
Bio/Technology 3:241 (1985); Byrne et al., Plant Cell, Tissue and Organ 
Culture 8:3 (1987); Sukhapinda et al., Plant MoL Biol. 8:209-216 (1987); 

25 Lorz et al., Mol. Gen. Genet. 199:178 (1985); Potrykus, Mol. Gen. Genet 
199:183 (1985); Park et al., J. Plant Biol. 38(4):365-71 (1995); Hiei et al., 
Plant J. 6:271-282 (1994)). The use of T-DNA to transform plant cells has 
received extensive study and is amply described (EP 120516; Hoekema, 
In: The Binary Plant Vector System. Offset-drukkerii Kanters B.V.; 

30 Alblasserdam (1985), Chapter V; Knauf, et al., Genetic Analysis of Host 
Range Expression by Agrobacterium, In: Molecular Genetics of the 
Bacteria-Plant Interaction . Puhler, A. Ed.; Springer-Verlag: New York, 
1983, p 245; and An, et al., EMBO J. 4:277-284 (1985)). For introduction 
into plants, the chimeric genes of the invention can be inserted into binary 

35 vectors as described in the examples. 

Other transformation methods are available to those skilled in the 
art, such as direct uptake of foreign DNA constructs (see EP 295959), 
techniques of electroporation (see Fromm et al., Nature (London) 319:791 
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useful way to quantitate protein expression and to detect replication in 
different plant tissues is to use a reporter gene, such as GUS. Once 
transgenic plants have been obtained, they may be grown to produce 
plant tissues or parts having the desired phenotype. The plant tissue or 
5 • plant parts may be harvested, and/or the seed collected. The seed may 
serve as a source for growing additional plants with tissues or parts having 
the desired characteristics. 

Construction of Chimeric Genes for Transformation 
Overexpression of the instant c/s-prenyltransferases may be 

10 accomplished by first constructing chimeric genes in which the coding 

region is operably-linked to promoters capable of directing expression of a 
gene in the desired tissues at the desired stage of development. For 
reasons of convenience, the chimeric genes may comprise promoter 
sequences and translation leader sequences derived from the same 

15 genes. 3' Non-coding sequences encoding transcription termination 
signals must also be provided. The instant chimeric gene may also 
comprise one or more introns in order to facilitate gene expression. 

Any combination of any promoter and any terminator capable of 
inducing expression of a coding region may be used in the chimeric 

20 genetic sequence. Some suitable examples of promoters and terminators 
include those from nopaline synthase (nos), octopine synthase (ocs) and 
cauliflower mosaic virus (CaMV) genes. One type of efficient plant 
promoter that may be used is a high-level plant promoter. Such 
promoters, in operable linkage with the genetic sequences or the present 

25 invention should be capable of promoting expression of the present gene 
product. High level plant promoters that may be used in this invention, for 
example, include the promoter of the small subunit (ss) of the ribulose-1,5- 
bisphosphate carboxylase from soybean (Berry-Lowe et al., J. Molecular 
andApp. Gen., 1:483-498 (1982)), and the promoter of the chlorophyll a/b 

30 binding protein. These two promoters are known to be light-induced in 
plant cells (see, for example, Genetic Engineering of Plants, an 
Agricultural Perspective , A. Cashmore, Ed.; Plenum: NY, 1983; pp 29-38; 
Coruzzi, G. et a/., The Journal of Biological Chemistry, 258:1 399 (1983); 
and Dunsmuir, P. et a/., Journal of Molecular and Applied Genetics, 2:285 

35 (1983)). 

Plasmid vectors comprising the instant chimeric genes can then be 
constructed. The choice of a plasmid vector depends upon the method 
that will be used to transform host plants. The skilled artisan is well aware 
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of the genetic elements that must be present on the plasmid vector in 
order to successfully transform, select and propagate host cells containing 
the chimeric gene. The skilled artisan will also recognize that different 
independent transformation events will result in different levels and 
patterns of expression (Jones et al., EMBO J. 4:2411-2418 (1985); 
De Almeida et al., Mol. Gen. Genetics 218:78-86 (1989)), and thus that 
multiple events must be screened in order to obtain lines displaying the 
desired expression level and pattern. Such screening may be 
accomplished by Southern analysis of DNA blots (Southern, J. Mol. Biol. 
98: 503 (1975)), Northern analysis of mRNA expression (Kroczek, J 
Chromatogr. Biomed. Appl., 618(1-2): 133-145 (1993)), Western analysis 
of protein expression, or phenotypic analysis. 

For some applications it may be useful to direct the 
c/s-prenyltransferase proteins to different cellular compartments or to 
1 5 facilitate their secretion from the cell. It is thus envisioned that the 

chimeric genes described above may be further modified by the addition 
of appropriate intracellular or extracellular targeting sequences to their 
coding regions (and/or with targeting sequences that are already present 
removed). These additional targeting sequences include chloroplast 
20 transit peptides (Keegstra et al., Cell 56:247-253 (1 989)), signal 

sequences that direct proteins to the endoplasmic reticulum (Chrispeels 
etal., Ann. Rev. Plant Phys. Plant Mol. 42:21-53 (1991)), and nuclear 
localization signal (Raikhel et al., Plant Phys. 100: 1627-1 632 (1992)). 
While the references cited give examples of each of these, the list is not 
25 exhaustive and more targeting signals of utility may be discovered in the 
future which are useful in the invention. 
Recombinant F xpression - Microbial 

The genes and gene products of the instant sequences may also 
be produced in heterologous host cells, particularly in the cells of microbial 
30 hosts. Production of natural rubbers in microbial hosts will be useful when 
an appropriate source of IPP is present in the cell to produce appropriate 
initiator molecules (DMAPP, GPP, FPP or GGPP). Expression in 
recombinant microbial hosts may be useful for the expression of various 
pathway intermediates; or for the modulation of pathways already existing 
in the host for the synthesis of new products heretofore not possible using 
the host. Additionally, recombinant expression may be useful for the 
preparation of antibodies to the c/s-prenyltransferase protein by-methods 
well known to those skilled in the art. The antibodies would be useful for 
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detecting the instant c/s-prenyltransferase proteins in situ in cells or in vitro 
in cell extracts. 

Preferred Microbial Hosts and Transformation Methods 
Preferred heterologous host cells for expression of the instant 
5 genes and nucleic acid fragments are microbial hosts that can be found 
broadly within the fungal or bacterial families and which grow over a wide 
range of temperature, pH values, and solvent tolerances. For example, it 
is contemplated that any bacteria, yeast, and filamentous fungi will be 
suitable hosts for expression of the present nucleic acid fragments. 

10 Because transcription, translation and the protein biosynthetic apparatus 
is the same irrespective of the cellular feedstock, functional genes are 
expressed irrespective of carbon feedstock used to generate cellular 
biomass. Large-scale microbial growth and functional gene expression 
may utilize a wide range of simple or complex carbohydrates, organic 

15 acids and alcohols, or saturated hydrocarbons such as methane or carbon 
dioxide (in the case of photosynthetic or chemoautotrophic hosts). 
However, the functional genes may be regulated, repressed or depressed 
by specific growth conditions, which may include the form and amount of 
nitrogen, phosphorous, sulfur, oxygen, carbon or any trace micronutrient 

20 including small inorganic ions. In addition, the regulation of functional 

genes may be achieved by the presence or absence of specific regulatory 
molecules that are added to the culture and are not typically considered 
nutrient or energy sources. Growth rate may also be an important 
regulatory factor in gene expression. Examples of host strains include but 

25 are not limited to bacterial (e.g., Bacillus, Escherichia, Salmonella and 
Shigella), fungal, or yeast species (e.g., Aspergillus, Saccharomyces, 
Pichia, Candida and Hansenula), 

Methods for the transformation of such hosts and the expression of 
foreign proteins are well known in the art and examples of suitable 

30 protocols may be found In Manual of Methods for General Bacteriology] 
Gerhardt et al., Eds.; American Society for Microbiology: Washington, 
DC, 1994 or In Biotechnology: A Textbook of Industrial Microbiology, 2 nd 
ed., Brock, T. D., Ed.; Sinauer Associates: Sunderland, MA, 1989. 
Construction of Chimeric Genes for Transformation 

35 Microbial expression systems and expression vectors containing 

regulatory sequences that direct high level expression of foreign proteins 
are well known to those skilled in the art. Any of these could be used to 
construct chimeric genes for production of the instant c/s- 

29 



WO 2004/044173 



PCT/US2003/036164 



10 



15 



prenyltransferases. These chimeric genes could then be introduced into 
appropriate microorganisms via transformation to provide high level 
expression of the instant c/s-prenyltransferase proteins. 

Vectors or cassettes useful for the transformation of suitable 
microbial host cells are well known in the art. Typically the vector or 
cassette contains sequences directing transcription and translation of the 
relevant gene, a selectable marker, and sequences allowing autonomous 
replication or chromosomal integration. Suitable vectors comprise a 
region 5' of the gene that harbors transcriptional initiation controls and a 
region 3' of the DNA fragment that controls transcriptional termination. It 
is most preferred when both control regions are derived from genes 
homologous to the transformed host cell, although it is to be understood 
that such control regions need not be derived from the genes native to the 
specific species chosen as a production host. 

Initiation control regions or promoters that are useful to drive 
expression of the instant c/s-prenyltransferases in the desired microbial 
host cell are numerous and familiar to those skilled in the art. Virtually any 
promoter capable of driving these genes is suitable for the instant 
invention including, but not limited to: CYC1, HIS3, GAL1, GAL10 ADH1 
20 PGK, PHOS, GAPDH, ADC1, TRP1, URA3, LEU2, ENO, ' TPl (useful for ' . 
expression in Saccharomyces); AOX1 (useful for expression in Pichia); 
and lac, ara, tet, trp, IP L , IP R , jj, tac, and trc (useful for expression in ' 
Escherichia coli) as well as the amy, apr, npr promoters and various 
phage promoters (useful for expression in Bacillus). 
25 Termination control regions may also be derived from various 

genes native to the preferred hosts. Optionally, a termination site may be 
unnecessary; however, it is most preferred if included. 
Industrial Production j n Microbial Hosts 

Where commercial production of the instant enzymes are desired a 
30 variety of culture methodologies may be applied. For example, large- 
scale production of a specific gene product overexpressed from a 
recombinant microbial host may be produced by both batch or continuous 
culture methodologies. 

A classical batch culturing method is a closed system where the 
35 composition of the media is set at the beginning of the culture and not 
subject to artificial alterations during the culturing process. Thus, at the 
beginning of the culturing process the media is inoculated with the desired 
organism or organisms and growth or metabolic activity is permitted to 
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occur adding nothing to the system. Typically, however, a "batch" culture 
is batch with respect to the addition of carbon source and attempts are 
often made at controlling factors such as pH and oxygen concentration. In 
batch systems the metabolite and biomass compositions of the system 
5 change constantly up to the time the culture is terminated. Within batch 
cultures cells moderate through a static lag phase to a high growth log 
phase and finally to a stationary phase where growth rate is diminished or 
halted. If untreated, cells in the stationary phase will eventually die. Cells 
in log phase are often responsible for the bulk of production of end 
10 product or intermediate in some systems. Stationary or post-exponential 
phase production can be obtained in other systems. 

A variation on the standard batch system is the Fed-Batch system. 
Fed-Batch culture processes are also suitable in the present invention and 
comprise a typical batch system with the exception that the substrate is 
15 added in increments as the culture progresses. Fed-Batch systems are 
useful when catabolite repression is apt to inhibit the metabolism' of the 
cells and where it is desirable to have limited amounts of substrate in the 
media. Measurement of the actual substrate concentration in Fed-Batch 
systems is difficult and is therefore estimated on the basis of the changes 
20 of measurable factors such as pH, dissolved oxygen and the partial 
pressure of waste gases such as CO2. Batch and Fed-Batch culturing 
methods are common and well known in the art and examples may be 
found in Thomas D. Brock in Biotechnology: A Textbook of Industrial 
Microbiology, 2 nd ed., Brock, T. D., Ed.; Sinauer Associates: Sunderland, 
25 MA, 1989; or Deshpande, Mukund V., Appl. Biochem. Biotechnol., 36: 227 
(1992), herein incorporated by reference. 

Commercial production of the instant c/s-prenyltransferases and 
their proteins may also be accomplished with a continuous culture. 
Continuous cultures are open systems where a defined culture media is 
30 added continuously to a bioreactor and an equal amount of conditioned 
media is removed simultaneously for processing. Continuous cultures 
generally maintain the cells at a constant high liquid phase density where 
cells are primarily in log phase growth. Alternatively, continuous culture 
may be practiced with immobilized cells where carbon and nutrients are 
35 continuously added and valuable products, by-products, or waste products 
are continuously removed from the cell mass. Cell immobilization may be 
performed using a wide range of solid supports composed of natural 
and/or synthetic materials. 
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Continuous or semi-continuous culture allows for the modulation of 
one factor or any number of factors that affect cell growth or end product 
concentration. For example, one method will maintain a limiting nutrient 
such as the carbon source or nitrogen level at a fixed rate and allow all 
5 other parameters to moderate. In other systems, a number of factors 
affecting growth can be altered continuously while the cell concentration 
measured by media turbidity, is kept constant. Continuous systems strive 
to ma.ntain steady state growth conditions and thus the cell loss due to 
media being drawn off must be balanced against the cell growth rate in 
1 0 the culture. Methods of modulating nutrients and growth factors for 
continuous culture processes as well as techniques for maximizing the 
rate of product formation are well known in the art of industrial 
microbiology and a variety of methods are detailed by Brock, supra. 

Fermentation media in the present invention must contain suitable 
15 carbon substrates. Suitable substrates may include, but are not limited to- 
monosaccharides (e.g., glucose and fructose), oligosaccharides (e g 
lactose or sucrose), polysaccharides (e.g., starch, cellulose, or mixtures 
thereof), and unpurified mixtures from renewable feedstocks (e g cheese 
whey permeate, cornsteep liquor, sugar beet molasses, and barley malt) 
20 Additionally the carbon substrate may also be one-carbon substrates such 
as carbon dioxide, methane or methanol for which metabolic conversion 
into key biochemical intermediates has been demonstrated. In addition to 
one and two carbon substrates, methylotrophic organisms are also known 
to utilize a number of other carbon containing compounds such as 
25 methylamine, glucosamine and a variety of amino acids for metabolic 

activity. For example, methylotrophic yeast are known to utilize the carbon 
from methylamine to form trehalose or glycerol (Bellion et al., Microb 
Growth C1 Compd., [Int. Symp.], 7 th ed.; Murrell, J. Collin; Kelly Don P 
Eds, Intercept: Andover, UK, 1993; pp 415-32). Similarly, various species 
30 of Cand,da will metabolize alanine or oleic acid (Suiter et al Arch 

Microbiol. 1 53:485-489 (1 990)). Hence it is contemplated that the source 
of carbon utilized in the present invention may encompass a wide variety 
of carbon containing substrates and will only be limited by the choice of 
host organism. 
35 Pathway Engineering 

Knowledge of the sequence of the present genes will be useful in 
manipulating the polyisoprenoid biosynthetic pathways in any organism 
having such a pathway and particularly in other rubber producing plants 
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Methods of manipulating genetic pathways are common and well known in 
the art. Selected genes in a particularly pathway may be up-regulated or 
down-regulated by variety of methods. Additionally, competing pathways 
in an organism may be eliminated or sublimated by gene disruption and 
5 similar techniques. 

Once a key genetic pathway has been identified and sequenced, 
specific genes may be up-regulated to increase the output of the pathway. 
For example, additional copies of the targeted genes may be introduced 
into the host cell on multicopy plasmids such as pBR322. Alternatively the 

10 target genes may be modified so as to be under the control of non-native 
promoters. Where it is desired that a pathway operate at a particular point 
in a cell cycle or during a fermentation run, regulated or inducible 
promoters may used to replace the native promoter of the target gene. 
Similarly, in some cases the native or endogenous promoter may be 

15 modified to increase gene expression. For example, endogenous 

promoters can be altered in vivo by mutation, deletion, and/or substitution 
(see, Kmiec, U.S. Patent 5,565,350; Zarling et a/., PCT/US93/03868). 

Alternatively, it may be necessary to reduce or eliminate the 
expression of certain genes in the target pathway or in competing 

20 pathways that may serve as competing sinks for energy or carbon. 

Methods of down-regulating genes for this purpose have been explored. 

For example, where sequence of the gene to be disrupted is 
known, one* of the most effective methods for gene down-regulation is 
targeted. gene disruption where foreign DNA is inserted into a structural 

25 gene so as to disrupt transcription. This can be effected by the creation of 
genetic cassettes comprising the DNA to be inserted (often a genetic 
marker) flanked by sequences having a high degree of homology to a 
portion of the gene to be disrupted. Introduction of the cassette into the 
host cell results in insertion of the foreign DNA into the structural gene via 

30 the native DNA replication mechanisms of the cell (see for example 
Hamilton et al. J. Bacteriol. 171:4617-4622 (1989); Balbas et al. Gene 
136:211-213 (1993); Gueldener et al. Nucleic Acids Res, 24:2519-2524 
(1996); and Smith et al. Methods MoL Celt. BioL 5:270-277(1996)). 

Alternative methods are available to reduce or eliminate expression 

35 of genes encoding the instant polypeptides, if desirable in plants for some 
applications. In order to accomplish this, a chimeric gene designed for co- 
suppression of the instant polypeptide can be constructed by linking a 
gene or gene fragment encoding that polypeptide to plant promoter 

33 



WO 2004/044173 



PCT/US2003/036164 



10 



15 



20 



25 



30 



35 



sequences. Antisense technology requires that a nucleic acid segment 
from the desired gene is cloned and operably linked to a promoter such 
that the anti-sense strand of RNA will be transcribed. This construct is 
then introduced into the host cell and the antisense strand of RNA is 
produced. Antisense RNA inhibits gene expression by preventing the 
accumulation of mRNA which encodes the protein of interest. The person 
skilled in the art will know that special considerations are associated with 
the use of antisense technologies in order to reduce expression of ' 
particular genes. For example, the proper level of expression of antisense 
genes may require the use of different chimeric genes utilizing different 
regulatory elements known to the skilled artisan. Nonetheless, either the 
co-suppression or antisense chimeric genes could be introduced into 
plants via transformation wherein expression of the corresponding 
endogenous genes is reduced or eliminated. 

Finally, one recent variation upon "classical" antisense and 
cosuppression methodologies is embodied in WO 02/00904, published on 
January 3, 2002. Specifically, it was found that suitable nucleic acid 
sequences and their reverse complement can be used to alter the 
expression of any mRNA encoding a protein of interest which is in 
proximity to the suitable nucleic acid sequence and its reverse 
complement. Surprisingly, the suitable nucleic acid sequence and its 
reverse complement can be either unrelated to any endogenous RNA in 
the host or can be encoded by any nucleic acid sequence in the genome 
of the host provided that the nucleic acid sequence does not encode any 
target mRNA or any sequence that is substantially similar to the target 
mRNA. A preferred artificial and non-naturally occurring, sequence is that 
encoded by the peptide "ELVISLIVES" (SEQ ID NO:35). This approach 
permits a very efficient and robust approach to achieving single, or 
multiple, gene co-suppression using single plasmid transformation. 

Molecular genetic solutions to the generation of plants with altered 
gene expression have a decided advantage over more traditional plant 
breeding approaches. Changes in plant phenotypes can be produced by 
specifically inhibiting expression of one or more genes by antisense 
inhibition or cosuppression or similar methodologies thereto (U S Patent 
No. 5,190,931; U.S. 5,107,065; U.S. 5,283,323; WO 02/00904). An 
antisense or cosuppression construct would act as a dominant negative 
regulator of gene activity. While conventional mutations can yield 
negative regulation of gene activity, these effects are most likely 
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recessive. The dominant negative regulation available with a transgenic 
approach may be advantageous from a breeding perspective. In addition, 
the ability to restrict the expression of specific phenotype to the 
reproductive tissues of the plant by the use of tissue specific promoters 
5 may confer agronomic advantages relative to conventional mutations that 
may have an effect in all tissues in which a mutant gene is ordinarily 
expressed. 

A person skilled in the art will know that special considerations are 
associated with the use of antisense or cosuppression technologies in 

10 order to reduce expression of particular genes. For example, the proper 
level of expression of sense or antisense genes may require the use of 
different chimeric genes utilizing different regulatory elements known to 
the skilled artisan. Once transgenic plants are obtained by one of the 
methods described above, it will be necessary to screen individual 

15 transgenics for those that most effectively display the desired phenotype. 
Accordingly, the skilled artisan will develop methods for screening large 
numbers of transformants. The nature of these screens will generally be 
chosen on practical grounds, and is not an inherent part of the invention. 
For example, one can screen by looking for changes in gene expression 

20 by using antibodies specific for the protein encoded by the gene being 
suppressed, or one could establish assays that specifically measure 
enzyme activity. A preferred method will be one that allows large 
numbers of samples to be processed rapidly, since it will be expected that 
a large number of transformants will be negative for the desired 

25 . phenotype, 

Although targeted gene disruption and antisense technology offer 
effective means of down-regulating genes where the sequence is known, 
other less specific methodologies have been developed that are not 
sequence based. For example, cells may be exposed to UV radiation and 

30 then screened for the desired phenotype. Mutagenesis with chemical 
. agents is also effective for generating mutants and commonly used 
substances include chemicals that affect nonreplicating DNA such as 
HN0 2 and NH 2 OH, as well as agents that affect replicating DNA such as 
acridine dyes, notable for causing frameshift mutations. Specific methods 

35 for creating mutants using radiation or chemical agents are well 
documented in the art. See, for example: Thomas D. Brock in 
Biotechnology: A Textbook of Industrial Microbiology, 2 nd ed., Brock, T. D., 
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Ed.; Sinauer Associates: Sunderland, MA, 1989; or Deshpande, Mukund 
V., Appl. Biochem. Biotechnol., 36: 227 (1992). 

Another non-specific method of gene disruption is the use of 
transposable elements or transposons. Transposons are genetic elements 
5 that insert randomly in DNA but can be later retrieved on the basis of 

sequence to determine where the insertion has occurred. Both in vivo and 
in vitro transposition methods are known. Both methods involve the use of 
a transposable element in combination with a transposase enzyme. When 
the transposable element or transposon is contacted with a nucleic acid 
1 0 fragment in the presence of the transposase, the transposable element will 
randomly insert into the nucleic acid fragment. The technique is useful for 
random mutagenesis and for gene isolation, since the disrupted gene may 
be identified on the basis of the sequence of the transposable element. 
Kits for in vitro transposition are commercially available (see for example 
• 15 The Primer Island Transposition Kit, available from Perkin Elmer Applied 
Biosystems, Branchburg, NJ, based upon the yeast Ty1 element; The 
Genome Priming System, available from New England Biolabs, Beverly, 
MA, based upon the bacterial transposon Tn7; and the EZ::TN Transposon 
Insertion Systems, available from Epicentre Technologies, Madison, Wl, 
20 based upon the Tn5 bacterial transposable element). 
Protein Engineering 

It is contemplated that the instant nucleotides may be used to 
produce gene products having enhanced or altered activity. For example, 
the mutation of frans-prenyltransferases such as farnesyl diphosphate 
25 synthase to a form capable of generating a different and longer product 
(geranylgeranyl diphosphate) than the unmodified enzyme has been 
demonstrated (Ohnuma, S.-l. et al., J. Biol. Chem., 271(17): 10087-10095 
(1996)). Various methods are known for mutating a native gene sequence 
to produce a gene product with altered or enhanced activity including, but 
30 not limited to: 

1 . ) error prone PCR (Melnikov et al., Nucleic Acids Research, 

27(4): 1056-1062 (February 15, 1999)); 

2. ) site directed mutagenesis (Coombs et al., Proteins : Angeletti, 

Ruth Hogue, Ed.; Academic: San Diego, CA, 1998; pp 259-311, 
35 1 plate); and 

3. ) "gene shuffling" (U.S. 5,605,793; U.S. 5,81 1 ,238; 

_ U -S. 5,830,721; and U.S. 5,837,458, incorporated herein by 

reference). 
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The method of gene shuffling is particularly attractive due to its 
facile implementation, and high rate of mutagenesis and ease of 
screening. The process of gene shuffling involves the restriction 
endonuclease cleavage of a gene of interest into fragments of specific 
5 size in the presence of additional populations of DNA regions of both 
similarity to (or difference to) the gene of interest. This pool of fragments 
will then be denatured and reannealed to create a mutated gene. The 
mutated gene is then screened for altered activity. 

The instant plant sequences of the present invention may be 

10 mutated and screened for altered or enhanced activity by this method. 
The sequences should be double stranded and can be of various lengths 
ranging from 50 bp to 10 kb. The sequences may be randomly digested 
into fragments ranging from about 10 bp to 1000 bp, using restriction 
endonucleases well known in the art (Maniatis, supra). In addition to the 

15 instant plant sequences, populations of fragments that are hybridizable to 
all or portions of the microbial sequence may be added. Similarly, a 
population of fragments that are not hybridizable to the instant sequence 
may also be added. Typically these additional fragment populations are 
added in about a 10 to 20 fold excess by weight as compared to the total 

20 nucleic acid. Generally, if this process is followed, the number of different 
specific nucleic acid fragments in the mixture will be about 100 to about 
1000. The mixed population of random nucleic acid fragments are 
denatured to form single-stranded nucleic acid fragments and then 
reannealed. Only those single-stranded nucleic acid fragments having 

25 regions of homology with other single-stranded nucleic acid fragments will 
reanneal. The random nucleic acid fragments may be denatured by 
heating. One skilled in the art could determine the conditions necessary 
to completely denature the double stranded nucleic acid. Preferably the 
temperature is from about 80°C to 100°C. The nucleic acid fragments 

30 may be reannealed by cooling. Preferably the temperature is from about 
20°C to 75°C. Renaturation can be accelerated by the addition of 
polyethylene glycol ("PEG") or salt. A suitable salt concentration may 
range from 0 mM to 200 mM. The annealed nucleic acid fragments are 
then incubated in the presence of a nucleic acid polymerase and dNTPs 

35 (i.e., dATP, dCTP, dGTP and dTTP). The nucleic acid polymerase may 
be the Klenow fragment, the Taq polymerase or any other DNA. 
polymerase known in the art. The polymerase may be added to the 
random nucleic acid fragments prior to annealing, simultaneously with 
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annealing or after annealing. The cycle of denaturation, renaturation and 
incubation in the presence of polymerase is repeated for a desired 
number of times. Preferably the cycle is repeated from 2 to 50 times, 
more preferably the sequence is repeated from 10 to 40 times. The ' 
resulting nucleic acid is a larger double-stranded polynucleotide ranging 
from about 50 bp to about 100 Kb and may be screened for expression 
and altered activity by standard cloning and expression protocols 
(Manatis, supra). 

Furthermore, a hybrid protein can be assembled by fusion of 
functional domains using the gene shuffling (exon shuffling) method 
(N,xon et al., PNAS, 94:1069-1073 (1997)). The functional domain of the 
instant gene can be combined with the functional domain of other genes 
to create novel enzymes with desired catalytic function. A hybrid enzyme 
may be constructed using PCR overlap extension methods and cloned 
into various expression vectors using the techniques well known to those 
skilled in art. 
Other Applications 

The instant c/'s-prenyltransferase proteins can be used as a target 
to facilitate the design and/or identification of inhibitors of c/s-prenyl- 
transferases that may be useful as herbicides or fungicides. This could 
be achieved either through the rational design and synthesis of potent 
functional inhibitors that result from structural and/or mechanistic 
information that is derived from the purified instant plant proteins, or 
through random in vitro screening of chemical libraries. It is anticipated 
that significant in vivo inhibition of the c/s-prenyltransferase proteins 
described herein may severely cripple cellular metabolism and likely result 
in plant (or fungal) death. 

All or a portion of the nucleic acid fragments of the instant invention 
may also be used as probes for genetically and physically mapping the 
genes that they are a part of, and as markers for traits linked to 
expression of the instant c/s-prenyltransferases. Such information may be 
useful in plant breeding in order to develop lines with desired phenotypes. 
For example, the instant nucleic acid fragments may be used as 
restriction fragment length polymorphism (RFLP) markers. Southern blots 
(Maniatus, supra) of restriction-digested plant genomic DNA may be 
probed with the nucleic acid fragments of the instant invention. The 
resulting banding patterns may then be subjected to genetic analyses 
using computer programs such as MapMaker (Lander et al., Genomics 
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1:174-181 (1987)) in order to construct a genetic map. In addition, the 
nucleic acid fragments of the instant invention may be used to probe 
Southern blots containing restriction endonuclease-treated genomic DNAs 
of a set of individuals representing parent and progeny of a defined 
5 genetic cross. Segregation of the DNA polymorphisms is noted and used 
to calculate the position of the instant nucleic acid sequences in the 
genetic map previously obtained using this population (Botstein et al., Am. 
J. Hum. Genet. 32:314-331 (1980)). 

The production and use of plant gene-derived probes for use in 
10 genetic mapping is described by Bernatzky et al. (Plant Mol. Biol. 
Reporter 4:37-41 (1986)). Numerous publications describe genetic 
mapping of specific cDNA clones using the methodology outlined above 
or variations thereof. For example, F2 intercross populations, backcross 
populations, randomly mated populations, near isogenic lines, and other 
15 sets of individuals may be used for mapping. Such methodologies are 
well known to those skilled in the art. 

Nucleic acid probes derived from the instant nucleic acid 
sequences may also be used for physical mapping (i.e., placement of 
sequences on physical maps; see Hoheisel et al., Nonmammalian 
20 Genomic Analysis: A Practical Guide; Academic, 1 996; pp. 31 9-346 and 
references cited therein). 

In another embodiment, nucleic acid probes derived from the 
instant nucleic acid sequences may be used in direct fluorescence in situ 
hybridization (FISH) mapping. Although current methods of FISH 
25 mapping favor use of large clones, improvements in sensitivity may allow 
performance of FISH mapping using shorter probes. 

A variety of nucleic acid amplification-based methods of genetic 
and physical mapping may be carried out using the instant nucleic acid 
sequences. Examples include allele-specific amplification (Kazazian 
30 et al., J. Lab. Clin. Med. 1 14:95-96 (1989)), polymorphism of PCR- 
amplified fragments (CAPS; Sheffield et al., Genomics 16:325-332 
(1993)), allele-specific ligation (Landegren et al., Science 241:1077-1080 
(1988)), nucleotide extension reactions (Sokolov et a!., Nucleic Acid Res. 
18:3671 (1990)), Radiation Hybrid Mapping (Walter et al., Nature 
35 Genetics 7:22-28 (1 997)), and Happy Mapping (Dear et al., Nucleic Acid 
Res, 17:6795-6807 (1989)). For these methods, the sequence of a 
nucleic acid fragment is used to design and produce primer pairs for use 
in the amplification reaction or in primer extension reactions. The design 
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of such pr im er s is well known to those skilled in the art , n methods using 
PCR-based genetic mapping, it may be necessary to identify DNA 

^ZTco^'T"^" ParentS ° f ,hS Cross <" *• 

5 Znerl P ! 9 ° ^ inStan ' nUClei ° aCW S ^ Usnce - ™s, however 
5 is generally not necessary for mapping methods 

instant L c 0 DNA° f ! UnCti0 r U,an,PhSn0,yPeS maybe ide "«ned for the 
nstan, oDNA clone erther by targeted gene disruption protocols or by 

Plan fc" 9 SP mUtantS f ° r "* 9ene C ° nta,TOd in a of 
10 nT* 7? . mUta " 0nS " a " P ° SSible 9enes <•*• B *"*>9eret al Proc 

USA T 402 (19S9,: KOSS 6 * a '" ^ ™ 

USA 92.8149 (1995); Bensen et al., Plant Cell 7:75 (19951) The latter 

approach may be accomplished in Wo ways. Firs., short egmen of he 

instant nuclerc acid fragments may be used in polymerase chain react"" 

. 5 pra ared ITt* 3 mU,a,i0n «" 

som th P ° P " ° f P ' an,S ln Which Mutator tansposons or 

some other mutation-causing DNA element has been introduced (see 
Bensen et al., supra). The amplification of a specific DNA fragment with 

eTr thTIn, "*"" *" inSert '° n """*» ^ " ™ 
near the plant gene encoding the cfe-prenyltransferase protein 

0 Aternabvely. the instan, nucleic acid fragments may be used as a 

the m^T T a93inSt PCR am P ,fflca *" Pf°°"oto generated from 
the mutation population using the mutation tag sequence primer in 
conjuncfion wfth an arbitrary genomic site primer, such as thatfora 
restoction enzyme site-anchored synthetic adaptor. With either method a 
> Plant contamrng a mutation in the endogenous gene encoding a 

crs-prenyltransferase protein can be identified and obtained. This mutant 
P ant can then be used to determine or confirm the natural funcnon ofThe 
crs-prenyKransferase gene product. 

DESCRIPTION OF PREFFRRFn PM Pf -,r,„,, rNT „ 
Numerous studies have examined prenyltransferases capable 
producing long-chain isoprenoids with frans-chain configuration However 
idenwcabon o, those prenyltransferases that condense isoprene unTin 
the configuration are less well studied. Undecapreny, pyraphosphate 

«1 1 483^ ST f r; S °' a,eC " r0m E ° 0 "' in 19 " b ' Apfel «» *Sa"' 
181(2). 483-492). Apfel et al. also published an alignment of the deduced 
amino aoid of ^ £ ^ Upp synthgse ^ 

of other publrcly-available sequences from bacteria, yeast > 
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(Saccharomyces cerevisiae) and one eukaryote (Caenorhabditis elegans), 
which revealed five conserved domains. These domains are shown 
below: 

Domain I: HxxxxMDGN(RG)R(WYF)A (SEQ ID NO:29); 

5 Domain II: GHxxG (SEQ ID NO:30); 

Domain III: (TS)xxAFS(ST)ENxxRxxxEVxxLMxL 

(SEQ ID NO:31); 

Domain IV: AxxYGGRx(DE)(LIVM)xxA (SEQ ID NO:32); 

Domain V: (DE)LxlRT(SAG)GExRxSNF(ML)(LMP)W 

10 QxxY(SAT)ExxFxxxxWP(DE)F (SEQ ID NO:33). 

Apfel et al. predicts that these conserved domains, as well as a few 
single conserved amino acids outside of the conserved domains, likely 
represent the active site of the protein. 

In the present invention, the Applicants describe unique plant 

15 homologs of microbial c/s-prenyltransferase proteins that are involved in 
the synthesis of poly-c/s-isoprenoids. More specifically, these c/s- 
prenyltransferases have been isolated from the natural rubber producing 
plants russian dandelion (Taraxacum kok-saghyz) and sunflower 
(Helianthus annus). Comparison of these cDNA sequences to the 

20 GenBank database using the BLAST algorithm, well known to those 

skilled in the art, reveals that these c/s-prenyltransferase proteins belong 
to the broad family of known c/s-prenyltransferase genes. This conclusion 
is additionally based on the presence of conserved domains I-V, as 
described by Apfel et al., supra. 

25 Further analysis of c/s-prenyltransferase sequences, however, 

reveals surprisingly unique characteristics that are specific for those c/s- 
prenyltransferases isolated from rubber-producing plants. More 
specifically, the Applicants describe: 

1. Modified sequences of conserved domains I, IV, and V, with 

30 respect to Apfel et al., that are indicative of the subfamily of c/s- 

prenyltransferases associated with rubber-producing plants; 
and 

2. A unique non-conserved domain between conserved domain IV 
and V, that is present in c/s-prenyltransferases from rubber- 

35 producing plants and that is absent in c/s-prenyltransferases 

from other plants. 
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These two identifying characteristics are thus diagnostic for cis- 
prenyltransferases from rubber-producing plants and will permit rapid 
identification of c/s-prenyltransferases from rubber-producing species. 

EXAMPLES 

The present invention is further defined in the following Examples 
It should be understood that these Examples, while indicating preferred 
embod.ments of the invention, are given by way of illustration only From 
the above discussion and these Examples, one skilled in the art can 
ascertain the essential characteristics of this invention, and without 
departing from the spirit and scope thereof, can make various changes 
and modifications of the invention to adapt it to various usages and 
conditions. 

GENERAL METHODS 
Standard recombinant DNA and molecular cloning techniques used 
here are well known in the art and are described by Sambrook et al 
Molecular Cloning: A Laboratory Manual 2 nd ed., Cold Spring Harbor 
Laboratory: Cold Spring Harbor, NY, 1989 (hereinafter "Maniatus"); and by 
T. J. Silhavy, M. L. Bennan, and L. W. Enquist, Experiments with n ona 
fusions, Cold Spring Harbor Laboratory: Cold Spring Harbor NY 1984- 
and by Ausubel et al., Current Protons in Molecular Riom^ published 
by Greene Publishing Assoc. and Wiley-lnterscience, 1 987. 
Nucleotide and amino acid percent identity and similarity comparisons 
were made using the BLAST (Basic Local Alignment Search Tool; Altschul 
et al., J. Mol. Biol. 215:403-410 (1993); see also 

www.ncbi.nlm.nih.gov/BLAST/) algorithms and also the Vector NTI suite of 
programs, applying default parameters unless indicated otherwise 
The meaning of abbreviations is as follows: "sec" means second(s) "min" 
means minute(s), "h" means hour(s), "d" means day(s), "uL" means ' 
m,croliter, »mL" means milliliters, "L" means liters, "uM" means micromolar 
'mM" means millimolar, "M" means molar, "mmol" means millimole(s) 
"umole" mean micromole", "g» means gram, "ug" means microgram "ng" 
means nanogram, "U" means units, "bp" means base pairs, and "kB" 
means kilobase. 

EXAMPLE 1 

Preparation of cDNA Li brar i es from Russian Dand^ on an d Snnfm^r 

This example describes the preparation of two cDNA libraries one 
from russian dandelion latex tissue and one from sunflower leaf tissue. 
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These libraries were then used for sequencing of expressed sequencing 
tags (ESTs). 

Library Construction for Russian Dandelion. Taraxacum kok-saahvz 
A cDNA library representing mRNAs from russian dandelion latex 
5 tissue was prepared, using the SMART cDNA Library Construction Kit 
(Clontech, Palo Alto, CA). The cDNAs were introduced into plasmid 
vectors by first preparing the cDNA library in XTriplEx2 vectors and then 
converted into a plasmid library (Clontech). Upon conversion, cDNA 
inserts were contained in the plasmid vector pTrip I Ex2 and plasmid DNAs 
10 were prepared from randomly selected bacterial colonies. Amplified insert 
DNAs or plasmid DNAs were sequenced in dye-primer sequencing 
reactions to generate partial cDNA sequences (expressed sequence tags 
or "ESTs"; see Adams et al., Science 252:1651-1656 (1991)). The 
resulting ESTs were analyzed using a Perkin Elmer Model 377 fluorescent 
15 sequencer. 

Library Construction for Sunflower. Helianthus annus 
SMF3 Sunflower plants were grown in the greenhouse for 4 weeks 
and then transferred to a growth chamber with a 12 hr photoperiod, at 
22°C and 80% relative humidity. The sunflower pathogen, Schlerotinia 
20 sclerotiorum (isolate 255M), was maintained on a PDA plate at 20°C in the 
dark. When the sunflower plants were 6 weeks old, they were inoculated 
with Sc/erof/n/a-infested carrot plugs with active growing mycelia. For 
each plant, three petioles were inoculated and wrapped with parafilm. 
Leaf tissue samples were collected, immediately frozen in liquid nitrogen, 
25 and stored at -80°C. 

Total RNA was isolated from this tissue using TriPure Reagent 
(Roche Applied Science, Indianapolis, IN). Subsequently, mRNAs were 
isolated using a mRNA purification kit (Invitrogen, Carlsbad, CA). A cDNA 
library representing mRNAs from sunflower leaf tissue infected with the 

30 pathogen S. sclerotiorum was prepared, using the Lamda ZAPII-cDNA 
synthesis kit (Stratagene, LaJolla, CA). Once the cDNA inserts were in 
plasmid vectors, plasmid DNAs were prepared from randomly selected 
bacterial colonies containing recombinant pBluescript plasmids. Amplified 
insert DNAs or plasmid DNAs were sequenced in dye-primer sequencing 

35 reactions to generate partial cDNA sequences (expressed sequence tags 
or "ESTs"; see Adams et al., supra. The resulting ESTs were analyzed 
using a Perkin Elmer Model 377 fluorescent sequencer. 
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EXAMPLE 2 

Identification and Char a cterization of c^pjenyltraiTsferases 
This Example describes the methodology utilized to conduct 
BLAST analyses on each EST sequenced in Example 1 and the " 
identification of two novel c/s-prenyltransferase genes. 

Specifically, all sequences from Example 1 were identified by 
conducting BLAST (Basic Local Alignment Search Tool; Altschul, S. F 
et al., (1993) J. Mol. Biol. 215:403-410; see also 
www.ncbi.nlm.nih.gov/BLAST/) searches for similarity to sequences 
contained in the BLAST "nr" database (comprising all non-redundant 
GenBank CDS translations, sequences derived from the 3-dimensional 
structure Brookhaven Protein Data Bank, the SWISS-PROT protein 
sequence database, EMBL, and DDBJ databases). 

The cDNA sequences were analyzed for similarity to all publicly 
available DNA sequences contained in the "nr" database using the 
BLASTN algorithm provided by the National Center for Biotechnology 
Information (NCBI). The DNA sequences were translated in all reading 
frames and compared for similarity to all publicly available protein 
sequences contained in the "nr" database using the BLASTX algorithm 
(Gish, W. and States, D. J. Nature Genetics 3:266-272 (1993)) provided bv 
the NCBI. 

cDNAs were further identified by searches of the database using 
the TBLASTN algorithm provided by the National Center for Biotechnology 
Informatioh (NCBI) and short fragments of conserved sequence present in 
25 known c/s-prenyltransferases (conserved domains l-V, as described by 
■Apfel et al., J. Bacteriol. 81:483-492 (1999)). These sections of 
conserved sequence were expected to be diagnostic for the 
c/s-prenyltransferase family of enzymes. 

The results of these BLAST comparisons are given below in 
30 Table 2 for the ESTs of the present invention. Table 2 summarizes the 
sequence to which each EST potentially encoding a c/s-prenyltransferase 
has the most similarity (presented as % similarities, % identities, and 
expectation values). The table displays data based on the BLASTXnr 
algorithm with values reported in expect values. The Expect value 
estimates the statistical significance of the match, specifying the number 
of matches, with a given score, that are expected in a search of a 
database of this size absolutely by chance. 
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The russian dandelion EST was found to have the highest 
homology (44% identity) to a partial clone of a c/s-prenyltransferase gene 
jrfrtArwfflfensfe (Accession Number AB061235), using automated 

, 7t m o? arCh8S 39ainSt Sec « L,ences deposited in the public databases 
5 (Table 2). To further analyze the dandelion EST sequence it was 

translated and aligned with other fidfclength c/s-prenyltransferase genes 
Using this approach the sequence exhibited 30.5% identity with its closest 
homolog, the Hevea Hpt1 gene product (SEQ ID NO:8) 

The sunflower EST sequence was found to have the highest 
10 homology (57% identity) to a full-length clone of a c/s-prenyltransferase 
gene of K brasiliensis (Accession Number AB061237), using automated 

n- hf ^ 88 a9ainSt S6quences de P° sited 'n the public databases 
(I able 2). Comparison of the sunflower EST sequence (SEQ ID NC-6) to 

15 T 1 9606 Pr ° dUCt (SE ° ,D N ° :8) determi ned that there was 

15 24.4% identity to Hpt1. 

In addition to the homology both ESTs exhibited with other known 
c/s-prenyltransferase genes, the russian dandelion and sunflower EST 
also was found to possess significant homology to one of the five 
conserved domains reported by Apfel et al. (supra). Specifically, both 
20 ESTs possesed the amino acid sequence- 

DILVRSSGETRLSNFLLWQTTNCVLYSPKALWPEM (SEQ ID NO- 34) 
wh.ch shares homology with Domain V of Apfel et al (supra) 

h / Urther ana ' ySiS ° f ° NA a,l 9 nments 1 however, revealed that 

25 f^n ITJr dande,i ° n and SUnfl ° Wer EST Se ^ ences did not encode 
25 full length ORFs. The 5' end of the russian dandelion cDNA appeared to 

oleTZZZr™ 5P ' Whi ' e ^ SUnfl ° Wer CDNA appeared to be ^ 
oven 92 bp of ,ts 5' sequence. The full-length c/s-prenyltransferase 

oDNA sequences, therefore, could not be determined, and the low % 
homo ogies in alignments with known c/s-prenyltransferases are due to 
30 use of partial cDNAs. 

EXAMPLE 3 
Acquisition of Full-lenq jnjjuj*^^ 

. *u r!!, describes the methodology used to isolate the full- 

length cDNA for the russian dandelion c/s-prenyltranferase, since the 
dandelion sequence analyzed in Example 2 appeared to be missing the 
5 end when aligned with known full-length c/s-prenyltransferases 
th* «, R H P ' d amp,if, ' Cati0n ° f CDNA en ds (RACE) was performed to obtain 
the 5 end sequence of the russian dandelion c/s-prenyltransferase gene 
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using the FirstChoice RLM-RACE Kit (Ambion, Austin, TX). The gene- 
specific oligonucleotides used for the outer 5'RLM-RACE PCR was 
NKH46 (SEQ ID NO:36) and for the inner 5'RLM-RACE PCR was NKH45 
(SEQ ID NO:37). Several PCR products were obtained by RACE. These 
5 products were then cloned Using a TOPO TA-cloning kit (Invitrogen, 
Carlsbad, CA) and transformed into E. coli. Plasmids were isolated and 
purified using QIAFilter cartridges (Qiagen, Valencia, CA). 

Sequences were generated on an ABI Automatic sequencer using 
dye terminator technology, using a combination of vector-specific primers, 

10 and editing was performed in Vector NTI (InforMax Inc., North Bethesda, 
MD). To aid in the analysis of RACE PCR products, the design of the 
primers used in RACE was such that the amplified 5' end RACE products 
contain at least 200 bp from the 5* end of the known partial cDNA 
sequence. Thus, the sequence of the PCR products obtained by RACE 

15 were aligned with the cDNA sequence of the russian dandelion c/s- 
prenyltransferase EST in Vector NTI's Contig Express. Those PCR 
products that did not align with at least 200 bp of the partial cDNA 
sequence of the russian dandelion c/s-prenyltransferase EST were 
discarded. One clone (#3-4) obtained by 5* RACE contained 258 bp of 

20 sequence (SEQ ID NO:2) identical to that of the EST representing the 
partial russian dandelion c/s-prenyltransferase cDNA, verifying that this 
RACE product was genuine. This allowed the sequence of the full-length 
russian dandelion cDNA clone (SEQ ID NO:3) to be assembled in Vector 
NTI's ContigExpress program. The deduced full-length amino acid 

25 sequence (SEQ ID NO:4) exhibited 49.8% identity (61.2% similarity) with 
that of the Hevea Hpt1 gene product (SEQ ID NO:8). 



domain in the c/s-prenyltransferases of rubber-producing plants, 
discovered from alignments of three Hevea c/s-prenyltransferases (SEQ 
ID NOs:8-10), the russian dandelion c/s-prenyltransferase (SEQ ID NO:4), 
and the sunflower c/s-prenyltransferase (SEQ ID NO:6). This domain will 
35 be a useful tool to rapidly identify c/s-prenyltransferases likely to be 
involved in long-chain rubber biosynthesis in the future. Additionally, 
modified conserved domains were identified for c/s-prenyltransferases 



EXAMPLE 4 
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Identification of a Diagnostic Non-Conserved Domain in Rubber- 
Producing c/s-Prenyltransferases 
This Example describes the identification of a non-conserved 
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from rubber-producing plant species, corresponding to the conserved 
domains of Apfel et al. (J. Bacterial. 81:483-492 (1999)) 

An aiignment of the deduced amino acid sequences of the cDNAs 

< ALo e n' nStant inVePti ° n ^ Vari0US kn ° Wn ^-Prenyltransferases 

5 (WO 01/21650) was created usina the* ni icitai \a/ ~ 

x/trr-r^D k i~t~! •* 9 OLU STALW program within the 

VECTOR NTI suite of programs (full alignment not shown). Specifically 
aligned sequences include those from: 1.) rubber-producing plants (i e 
russian dandelion, sunflower and Hevea, corresponding to SEQ ID 
NOs:4, 6 and 8-10); 2.) non-rubber-producing plants (i.e., rice, marigold 

SEO e . D S Mo e T,' T* da ' Sy ' ******* corresponding to 

SEQ © N o S :1 2l 7, 11, 14, 15, 16, and 23-26); and 3.) microbes (i e 

M^ooccus and SaccHeromyces, corresponding to SEQ ID NOs:18 and 
20 and 22). The alignment confirmed the presence of the conserved 
domains characteristic of this gene family (Apfel et al., supra) 

A portion of the alignment is shown in Figure 1 , corresponding to 
the reg,on between Domain IV and V. This region defines a non- 
conserved domain indicative of the subfamily of c/s-prenyltransferases 
associated with rubber-producing plants. Specifically, the domain 
comprises a sequence of non-conserved amino acids present between 
Domains IV and V, wherein the presence of the domain results in more 
than 50 amino acid residues being present between the absolutely 
conserved tyrosine of Domain IV and the first of the absolutely conserved 
argmme residues of Domain V. This is the first sequence feature to 
emerge as diagnostic for c/s-prenyltransferases from rubber-producing 
Plants, as there had not been enough proteins from such species 
characterized prior to this discovery to be able to identify such 
distinguishing feature(s). 

interestingly, SEQ ID NO:24, an Arabidopsis c/s-prenyltransferase 
genom,c clone of unknown function, alone of the non-rubber-producing 
spec.es, contains a similar insert to the identified_non-conserved domain 
of the present invention. This gene in Arabidopsis may thus represent a 
homolog of c/s-prenyltransferases involved in rubber production present in 
the genome of this species. 

Additionally, a c/s-prenyltransferase protein from a rubber- 
producing plant can be identified by the presence of the conserved 
domains of amino acid sequences as follows- 

Domain . AR(L/M)DGNRRFA (SEQ ID NO:38) 

DomamlV Y(T/S)SXX(D/E)IXXA (SEQ ID NO-30 
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Domain V PXPD(IAOL(IAOR(SyT)SG(E/L)(S/T)RLSNXLLWQ 

(SEQ ID NO:40) 

where these three domains occur sequentially in the order I, IV, V within 
the amino acid sequence and X may be any amino acid. These domains 
5 are essentially those recognized previously in bacterial sequences (Apfel, 
et al. supra), but have been modified to account for the differences 
observed in alignments of sequences of c/s-prenyltransferases derived 
from plants (WO 01/21650). 

EXAMPLE 5 

10 Expression analysis of the russian dandelion c/s-prenvltransferase 

This example describes work performed to examine the expression 
of the russian dandelion c/s-prenyltransferase in leaf, root, scape and 
latex tissues. As expected, the protein is expressed predominantly in 
tissues known to accumulate rubber in this species (i.e., in the rubber- 

15 containing latex). 

RNA was prepared from the leaf, root and scape of russian 
dandelion, using the RNAeasy Midi-Kit (Qiagen, Valencia, CA) for 
samples from plant tissue. RNA from russian dandelion latex was 
prepared as decribed by Kush, et.al. (Proc. Natl. Acad. ScL 87:1787-1790 

20 (1990)). 10 pg of total RNA from russian dandelion latex, leaf, root, and 
scape was denatured on a formadelhyde gel, using products and the 
supplied protocol from 5' to 3', Inc. (Boulder, CO). The gel was rinsed 
twice in 20x SSC for 15 min and then transferred to a nylon membrane 
(Roche Applied Science, Indianapolis, IN) by capillary action at 4°C 

25 overnight. The RNA was then crosslinked to the membrane using a UV 
crosslinker (Stratagene, La Jolla, CA). 

A digoxigenin (DIG) labeled russian dandelion c/s-prenyltransferase 
EST fragment was synthesized, using the PCR DIG Probe Synthesis Kit 
(Roche Applied Science, Indianapolis, IN) and the following 

30 oligonucleotides: Dan5 (SEQ ID NO:41) and Dan6 (SEQ ID NO:42). This 
probe was then hybridized to the membrane and detected using the DIG 
Wash and Block Buffer Set (Roche Applied Science, Indianapolis, IN). 
The membrane was then exposed to BioMax Scientific Imaging Film 
(Eastman Kodak Co., Rochester, NY) for 20 min. As shown in Figure 2A, 

35 c/s-prenyltransferase expression was detected in the root (lane A), scape 
(lane B) and latex (lane C) tissues, with the highest level of expression 
detected in latex. Little or no expression of c/s-prenyltransferase was 
detected in the leaf tissue (lane D). 
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The membrane was then stripped of the DIG labeled russian 
dandelion c/s-prenyltransferase probe by washing it in boiling 0.1% SDS 
for 10 min, followed by 1x Washing Buffer from the DIG Wash and Block 
Buffer Set for 5 min. A digoxigenin (DIG) labeled russian dandelion 
ubiquitin probe was synthesized, using the DIG DNA labeling Kit, 
according to the supplied protocol (Roche Applied Science). This probe 
was then hybridized to the membrane, detected using the DIG Wash and 
Block Buffer Set, and the membrane was exposed to BioMax Scientific 
Imaging Film (20 min). 

Ubiquitin expression was detected in all tissues (Figure 2B). 
Assuming that ubiquitin is equally expressed in all russian dandelion 
tissues, the amount of leaf (lane D), latex (lane C) and root (lane A) RNA 
loaded onto the gel was approximately equal while slightly more scape 
(lane B) RNA was loaded. It is clear from this analysis that the dandelion 
c/s-prenyltransferase gene is expressed predominantly in tissues known to 
accumulate rubber in this species, and in particular in the rubber- 
containing latex. Thus, there is a clear association between this gene 
product and rubber biosynthesis. 

EXAMPLE 6 

Cloning of a partial cDN A seq uence of ths russian dandelion ni*. 
prenyltransferase gene using syntheti c oligonucleotide primers in mvpr^. 

transcriptase PCR 
This Example serves to confirm the presence of a transcript of the 
cloned c/s-prenyltransferase gene in latex of russian dandelion, as 
indicated in the proceeding examples. It also demonstrates how synthetic 
oligonucleotide primers designed using gene sequences of plant cis- 
prenyltransferases may be used to clone additional c/s-prenyltransferase 
genes from other species. 

SEQ ID NOs:8-10, respresenting the Hevea Hpt1, Hpt2 and Hpt3 
proteins were aligned using Vector NTI. A degenerate sense primer was 
designed to a region of high conservation (SEQ ID NO:43). Then, the 
following amino acid sequences were aligned in Vector NTI: SEQ ID 
NOs:7-10 and 12-16, representing the c/s-prenyltransferase proteins from 
Hevea, pot marigold, rice, soybean, wheat, and the african daisy. A 
degenerate antisense primer was designed to a region of high 
conservation (SEQ ID NO:44). 

RT-PCR was performed on total russian dandelion latex RNA with 
these primers (SEQ ID NOs:43 and 44), using Platinum PCR SuperMix 
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(Invitrogen, Carlsbad, CA). The resulting RT-PCR products were TA- 
cloned, using the pGEM-T Easy Vector System (Prornega Corp., Madison, 
Wl) and the resulting plasmids were transformed into E. coli. Plasmids 
were isolated and purified using QIAFilter cartridges (Qiagen, Valencia, 
5 CA). Sequences were generated on an ABI Automatic sequencer using 
dye terminator technology, using a combination of vector-specific primers, 
and sequence editing was performed in Vector NTI. 

The nucleotide sequences of the RT-PCR products were aligned 
with nucleotide sequences of known plant c/s-prenyltransferase genes 

10 (Table 1). One 799 bp RT-PCR product (clone #4-4) showed significant 
homology to the known p/s-prenyltransferase genes. The deduced amino 
acid sequence of this RT-PCR product (SEQ ID NO:45) was aligned with 
the deduced amino acid sequences of the known plant c/s- 
prenyltransferase proteins as well as the amino acid sequence of the 

15 undecaprenyl diphosphate synthase (UPPS) protein and was determined 
by homology to be a russian dandelion homolog of UPPS. 



Comparison of rubbers prepared from different rubber-producing plant 

species 



from russian dandelion, Hevea, sunflower and guayule. 

The roots of 5 russian dandelion plants were cut off at the point 
where leaves emerged, and latex which seeped out of the cut roots was 
collected, yielding 200 mg latex. After stirring overnight in toluene (10 ml), 

25 the preparation was extracted with water in a separating funnel and the 
rubber precipitated from the organic phase by addition of an equal volume 
of methanol. After redissolving in toluene, methanol precipitation was 
repeated a further two times to purify the rubber. A total of 49.3 mg rubber 
was thus obtained, which was dissolved in toluene for analysis. 

30 Hevea and guayule (P. argentatum) washed rubber particles were 

prepared essentially according to previously published procedures 
(Cornish, K., et al. J. Natural Rubber Res. 8:275-285 (1993); Cornish, K., 
and Backhaus, R. Phytochemistry 29: 3808-3813 (1990)). Rubber was 
extracted into toluene and, after washing with water, precipitated three 

35 times with methanol as described above. From 274 mg guayule rubber 
particles, 45.6 mg rubber was obtained; and from 303.8 mg Hevea rubber 
particles, 50.8 mg rubber was obtained. 
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This Example compares the properties of natural rubber prepared 
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Sunflower rubber was prepared by extraction of freeze-dried lea'f 
matenal m a Soxhlet apparatus first with acetone and then with hexane 
To the hexane extract, an equal volume of methanol was added to 
precipitate the rubber. The precipitate was collected by filtration onto 
glass fiber filters, and after allowing solvent to evaporate, redissolved in 
toluene. Methanol precipitation from toluene was repeated three times 
From 27.7 g leaf dry weight, 5.1 mg rubber was obtained. 

To determine molecular weight, samples of rubber (dissolved in 
toluene) were subjected to gel permeation chromatography on PLGel 
columns (Polymer Laboratories, Amherst, MA) calibrated with polystyrene 
standards (Polymer Laboratories). Tetrahydrofuran (THF) was used as 
eluent, and refractive index and UV absorbtion were monitored 

Data obtained from these analyses (Table 3) show that rubber 
extracted from these 4 species exhibit marked differences in molecular 
we.ght and molecular weight distribution (MWD), or polydispersity The 
large degree of polydispersity in the rubber of Hevea is due to the 
presence of two distinct peaks in the chromatogram, as has previously 
been observed (Subramanian, A. Gel Permeation Chromatography of 
Natural Rubber. In, Rubber Chemistry & Technology March 1972- 
PP. 346-358). In contrast, the rubbers of russian dandelion, sunflower and 
guayule are monodisperse. 

The rubber obtained from russian dandelion exhibited a higher 
weight average molecular weight (MW) than that of Hevea while 
sunflower rubber was of considerably lower molecular weight in 
accordance with previous observations (Seiler, G.J., et al., Economic 
Botany 45: 4-15 (1991)). This molecular weight of sunflower is close to the 
molecular weight desired for an 'ideal' liquid natural rubber (LNR) which 
would have the following properties (Nor, H.M., and Ebdon, J R Progress 
in Polymer Science 23: 1 43-1 77 (1 998)): 

• A weight average molecular weight (Mw) of <80,000; 

• A number average molecular weight (Mn) of <50,000; 

• A MWD (determined as Mw/Mn) of <4.0; and 

• An intrinsic viscosity (IV) of 0.2 - 0.5. 
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Table 3 

Gel Permeation Chromatography analysis of plant rubbers 



PLANT 


MW 1 


MN 2 


MWD 3 


IV 4 


SPECIES 










H. brasiliensis 


1.44 x 10 6 


252,689 


5.71 


7.35 


H. annus 


68,998 


33,134 


2.08 


0.671 


P. argentatum 


1.47 x 10 6 


641,640 


2.3 


7.719 


T. kok-saghyz 


2.18 x 10 6 


1.21 x 10 6 


1.8 


10.633 



height average molecular weight 
5 2 Number average molecular weight 
3 MW/MN 



4 lntrinsic viscosity 

As expected from previous studies, different rubbers from different 
10 species can display marked differences in their fundamental properties of 
molecular weight, polydispersity, and intrinsic velocity. These factors must 
be considered during the development of alternative commercial rubber 
sources to Hevea, and are likely to be influenced by the specific c/s- 
preny (transferase enzymes involved in their polymerization. 

15 
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CLAIMS 

What is claimed is: 

1 . An isolated nucleic acid molecule encoding a cis- 
prenyltransferase enzyme, selected from the group consisting of: 
5 a) an isolated nucleic acid molecule encoding the amino acid 

sequence as set forth in SEQ ID NOs:4 and 6; 

b) an isolated nucleic acid molecule that hybridizes with (a) 
under the following hybridization conditions: 0.1X SSC, 
0.1% SDS, 65°C and washed with 2X SSC, 0.1% SDS 
10 followed by 0.1X SSC, 0.1% SDS; or 

an isolated nucleic acid molecule that is complementary to (a) or (b). 

2. An isolated nucleic acid molecule as set forth in SEQ ID NOs- 3 

and 5. 

3. A polypeptide encoded by the isolated nucleic acid molecule of 
15 Claim 1. 

4. A polypeptide encoded by the isolated nucleic acid molecule of 
Claim 2. 

5. A polypeptide having the amino acid sequence selected from 
the group consisting of SEQ ID NO:4 and SEQ ID NO:6. 

6. An isolated nucleic acid molecule comprising a first nucleotide 
sequence encoding a polypeptide of at least 301 amino acids that has at 
least 70% identity based on the Smith-Waterman method of alignment 
when compared to a polypeptide having the sequence as set forth in SEQ 
ID NO:4 or a second nucleotide sequence comprising the complement of 
the first nucleotide sequence, wherein said enzyme has cis- 
prenyltransferase activity. 

7. An isolated nucleic acid molecule comprising a first nucleotide 
sequence encoding a polypeptide of at least 168 amino acids that has at 
least 70% identity based on the Smith-Waterman method of alignment 
when compared to a polypeptide having the sequence as set forth in SEQ 
ID NO:6 or a second nucleotide sequence comprising the complement of 
the first nucleotide sequence, wherein said enzyme has cis- 
prenyltransferase activity. 

8. A chimeric gene comprising the isolated nucleic acid molecule 
of Claim 1 operably linked to suitable regulatory sequences. 

9. A transformed host cell comprising the chimeric gene of 
Claim 8. 
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10. The transformed host cell of Claim 9 wherein the host cell is 
selected from the group consisting of plant cells and microbial cells. 

11. A host cell according to Claim 10 selected from the group 
consisting of russian dandelion (Taraxacum kok-saghyz), rubber tree 

5 (Hevea brasiliensis), guayule (Parfhenium argentatum), sunflower 

(Helianthus spp.), tobacco (Nicotiana spp.), tomato (Lycopersicon spp.), 
potato (Solanum spp.), hemp (Cannabis spp.), sorghum (Sorghum 
vulgare), wheat (Triticum spp.), maize (Zea mays), rice (Oryza sativa), rye 
(Secale cereale), oats (Avena spp.), barley (Hordeum vulgare), rapeseed 

10 (Brassica spp.), broad bean (Vicia faba), french bean (Phaseolus 

vulgaris), other bean species (Vigna spp.), lentil (Lens culinaris), soybean 
(Glycine max), arabidopsis (Arabidopsis thaliana), cotton (Gossypium 
hirsutum), petunia (Petunia hybrida), flax (Linum usitatissimum) and carrot 
(Daucus carota sativa). 

15 12. The transformed host cell of Claim 10 wherein the host cell is 

selected from the group consisting of Aspergillus, Saccharomyces, Pichia, 
Candida, Hansenula, Bacillus, Escherichia, Salmonella and Shigella. 

13. A method of obtaining a nucleic acid molecule encoding a cis- 
prenyltransferase enzyme comprising: 

20 a) probing a genomic library with the nucleic acid molecule of 

Claim 1; 

b) identifying a DNA clone that hybridizes with the nucleic 
acid molecule of Claim 1 ; and 

c) sequencing the genomic fragment that comprises the 
25 clone identified in step (b), 

wherein the sequenced genomic fragment encodes a cis- 
prenyltransferase enzyme. 

14. A method of obtaining a nucleic acid molecule encoding a cis- 
prenyltransferase enzyme comprising: 

30 a) synthesizing at least one oligonucleotide primer 

corresponding to a portion of the sequence selected from 
the group consisting of SEQ ID NOs:3 and 5; and 
b) amplifying an insert present in a cloning vector using the 
oligonucleotide primer of step (a); 
35 wherein the amplified insert encodes a portion of an amino acid sequence 
encoding a c/'s-prenyltransferase enzyme. 

15. The product of the method of Claims 13 or 14. 
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16. A method of altering the level of expression of a plant 
c/s-prenyltransferase protein in a host cell comprising: 

(a) transforming a host cell with the chimeric gene of Claim 8 
and; 

5 (b) growing the transformed host cell produced in step (a) 

under conditions that are suitable for expression of the 
chimeric gene resulting in production of altered levels of 
a plant c/s-prenyltransferase protein in the transformed 
host cell relative to expression levels of an untransformed 
10 host cell. 

17. A method according to Claim 1 6 wherein the method of altering 
the level of expression of a plant c/s-prenyltransferase protein in a host 
cell comprises over-expressing at least one c/s-prenyltransferase gene 
selected from the group consisting of SEQ ID NOs: 3 and 5. 

15 1 8. A method according to Claim 16 wherein the method of altering 

the level of expression of a plant c/s-prenyltransferase protein in a host 
cell comprises over-expressing the c/s-prenyltransferase gene on a 
multicopy plasmid. 

19. A method according to Claim 1 6 wherein said chimeric gene is 
20 operably linked to an inducible or regulated promoter. 

20. A method according to Claim 16 wherein chimeric gene is 
expressed in antisense orientation. 

21 . A method according to Claim 1 6 wherein said chimeric gene is 
disrupted by insertion of foreign DNA into the coding region. 

25 22. A method according to Claim 16 wherein the altering the level 

of expression of a plant c/s-prenyltransferase protein results in a 
modulation in the defense mechanism of the plant. 

23. A method for the production of natural rubber compounds 
comprising: 

30 a) providing a transformed host cell comprising: 

(i) suitable levels of isopentenyl pyrophate; and 

(ii) a c/s-prenyltransferase gene selected from the group 
consisting of SEQ ID NOs: 3 and 5, wherein said 
genes are operably linked to suitable regulatory 

35 sequences; and 

b) growing the transformed host cell of (a) under conditions 
whereby a natural rubber compound is produced. 
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24. A method for the identification of a polypeptide having c/s- 
prenyltransferase activity in a rubber-producing plant comprising: 

(a) obtaining the amino acid sequence of a polypeptide 
suspected of having c/s-prenyltransferase activity; and 

5 (b) aligning the amino acid sequence of step (a) with the 

amino acid sequence of a c/s-prenyltransferase 
consensus sequence selected from the group consisting 
of SEQ ID NO:4, 6, 8, 9, and 10, wherein the alignment 
shows the presence of conserved domains I, IV, and V- 

10 (SEQ ID NOs: 38-40). 

25. A method for the identification of a polypeptide having c/s- 
prenyltransferase activity in a rubber-producing plant comprising: 

(a) obtaining the amino acid sequence of a polypeptide 
suspected of having c/s-prenyltransferase activity; and 

15 (b) aligning the amino acid sequence of step (a) with the 

amino acid sequence of a c/s-prenyltransferase 
consensus sequence selected from the group consisting 
of SEQ ID NO:4, 6, 8, 9, and 10, wherein the alignment 
shows a sequence of at least about 50 non-conserve,d 

20 amino acids present between the absolutely conserved 

tyrosine of Domain IV and the first of the absolutely 
V9 .conserved argihine residue of Domain V. 



57 



™S PAGE BUNK (uspto) 



JO/5320 

^Br/US2003/036164 




THIS PAGE BLANK iusno> 



WO 2004/044173 



2/2 



10/532013 



7US2003/036164 




o 

CL 
CO 

"a. 
Q 




!:vJ ■ 




CM 

CD 
i — 

ZD 

O) 

Li_ 



CQ 



JC20RecUPC^ro 2 0 APR 2005. 



THIS PAGE BLANK (Msrcoy 



WO 2004/044173 

SEQUENCE LISTING 

<110> E.I. du Pont de Nemours and Company 
Hallahan, David L. 



tO/53201? 

JC20Rec#^^ PRZtl 



<120> CIS- PRENYLTRANS FERASES FROM THE RUBBER-PRODUCING PLANTS RUSSIAN DANDELION 
(TARAXACUM KOK-SAGHYZ) AND SUNFLOWER (HELIANTHUS ANNUS) 

<130> CL2039 



<160> 45 

<170> Patentin version 3.1 

<210> 1 

<211> 746 

<212> DNA 

<213> Taraxacum kok-saghyz 



<400> 1 
gatcgaaggc 


tttctgaaag 


aagttagtat 


tataaaccaa 


tatggcgtta 


gagtcttgtt 


60 


catcggtgat 


ctcgataggt 


tatatgagcc 


cgtaaggatt 


gctgctgaga 


aggccatgga 


120 


agccaccgct 


aaaaactcaa 


ccacatatct 


cctcgtatgt 


gttgcttaca 


cttcttccca 


180 


tgaaaizccca 


cgtgccatcc 


acgaagcttg 


tgaagaaagc 


atacgggtca 


tgaacggaaa 


240 


cgggtttttc 


aatggaagcg 


gatataccaa 


cgtgaatcat 


ggaagtcagg 


cggtgatcaa 


300 


agtggtggat 


cttgataagc 


atatgtacat 


gggggtggca 


ccggatcctg 


atattttagt 


360 


aaggagctcc 


ggcgaaacaa 


ggctgagcaa 


ctttctgctg 


tggcagacca 


ccaactgttt 


420 


gttgtattcc 


ccgaaagctt 


tgtggccgga 


gatggggttc 


tggcaggtgg 


tttggggaat 


480 


cttggagttt 


caaaacaatt 


ataattactt 


ggagaagaag 


aagaagcagg 


cgtaaggatg 


540 


tgttcaaaaa 


gtaaggtaat 


ctgtctttaa 


atgagtttgg 


agtgtgctgt 


gagcattaat 


600 


gggatttttc 


ttcccaatat 


gaactttcaa 


ttttgggtcg 


attataatat 


atgatccata 


660 


tgtatatgaa 


cgttgtgtga tgcattatac 


gagcagaaga 


acgttgtatt 


tttactaaaa 


720 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaa 
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<212> DNA 

<213> Taraxacum kok-saghyz 
<400> 2 

gcccttcgcg gatccagacg ctgcgtttgc tggctttgat gaaaataatc tattccacca 60 

agttatctct ctctctctct ctctctctct ctctctctct ctctctgtct gtctctcttc 120 

ctctgtctct ctagtataca attggcaaat aggattaagc cggctcattt gttaaaccaa 180 

gatgcaagtg aatccaatca ttactacaga tagttcactg aaactagtgg aagaagaaag 240 

atcaaatggt aggatcggca atttcttagg aggcttaaac gccaccttaa gaaaactcgt 300 

gtttcgtgtc attgcttctc gcccaatccc agaacacatc gccttcatcc tcgatggaaa 360 

ccgaaggttc gccaggaaat ggaacctcac agaaggcgcc ggccacaaaa ccggcttcct 420 

agcactcatg tcggtcctca aatactgcta cgagatcgga gttaagtacg tcaccatcta 480 

cgccttcagc ctcgacaatt tcaatcgacg ccctgatgaa gtccagtacg tcatggactt 540 

gatgcaagac aagatcgaag gctttctgaa agaagttagt attataaacc aatatggcgt 600 

tagagtcttg ttcatcggtg atctcgatag gttatatgag cccgtaagga ttgctgctga 660 

gaaggccatg gaagccaccg ctaaaaactc aaccacatat ctcctcgtat gtgttgctta 720 
cacttcttcc catgaaatcc cacgtgccat ccacgaagct tgtgaagaaa gcatacgggt 
catgaacg 



780 
788 



<210> 3 
<211> 906 
<212> DNA 

<213> Taraxacum kok-saghyz 
<400> 3 

atgcaagtga atccaatcat tactacagat agttcactga aactagtgga agaagaaaga 60 
tcaaatggta ggatcggcaa tttcttagga ggcttaaacg ccaccttaag aaaactcgtg 120 
tttcgtgtca ttgcttctcg cccaatccca gaacacatcg ccttcatcct cgatggaaac 180 
cgaaggttcg ccaggaaatg gaacctcaca gaaggcgccg gccacaaaac cggcttccta 240 
gcactcatgt cggtcctcaa atactgctac gagatcggag ttaagtacgt caccatctac 300 
gccttcagcc tcgacaattt caatcgacgc cctgatgaag tccagtacgt catggacttg 360 
atgcaagaca agatcgaagg ctttctgaaa gaagttagta ttataaacca atatggcgtt 420 
agagtcttgt tcatcggtga tctcgatagg ttatatgagc ccgtaaggat tgctgctgag 480 
aaggccatgg aagccaccgc taaaaactca accacatatc tcctcgtatg tgttgcttac 540 
acttcttccc atgaaatccc acgtgccatc cacgaagctt gtgaagaaag catacgggtc 600 
atgaacggaa acgggttttt caatggaagc ggatatacca acgtgaatca tggaagtcag 660 
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gcggtgatca aagtggtgga tcttgataag catatgtaca tgggggtggc accggatcct 720 

gatattttag taaggagctc cggcgaaaca aggctgagca actttctgct gtggcagacc 780 

accaactgtt tgttgtattc cccgaaagct ttgtggccgg agatggggtt ctggcaggtg 840 

gtttggggaa tcttggagtt tcaaaacaat tataattact tggagaagaa gaagaagcag 900 

gcgtaa 906 

<210> 4 
<211> 301 
<212> PRT 

<213> Taraxacum kok-saghyz 
<400> 4 

Met Gin val Asn Pro lie lie Thr Thr Asp Ser Ser Leu Lys Leu val 
15 10 15 

Glu Glu Glu Arg Ser Asn Gly Arg lie Gly Asn Phe Leu Gly Gly Leu 
20 25 30 

Asn Ala Thr Leu Arg Lys Leu val Phe Arg val lie Ala ser Arg Pro 
35 40 45 

lie Pro Glu His lie Ala Phe lie Leu Asp Gly Asn Arg Arg Phe Ala 
50 55 60 

Arg Lys Trp Asn Leu Thr Glu Gly Ala Gly His Lys Thr Gly Phe Leu 
65 70 75 80 

Ala Leu Met Ser val Leu Lys Tyr cys Tyr Glu lie Gly Val Lys Tyr 
85 90 95 

val Thr lie Tyr Ala Phe Ser Leu Asp Asn Phe Asn Arg Arg Pro Asp 
100 105 110 

Glu Val Gin Tyr Val Met Asp Leu Met Gin Asp Lys lie Glu Gly Phe 
115 120 125 

Leu Lys Glu val Ser lie lie Asn Gin Tyr Gly Val Arg val Leu Phe 
130 135 140 

lie Gly Asp Leu Asp Arg Leu Tyr Glu Pro val Arg lie Ala Ala Glu 
145 150 155 160 

Lys Ala Met Glu Ala Thr Ala Lys Asn ser Thr Thr Tyr Leu Leu val 
165 170 ~ 175 

cys val Ala Tyr Thr ser Ser His Glu lie Pro Arg Ala lie His Glu 
180 185 190 
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Ala Cys Glu Glu ser He Arg Val Met Asn Gly Asn Gly Phe Phe Asn 
D 200 205 

Gly ser Gly Tyr Thr Asn Val Asn His Gly Ser Gin Ala Val He Lys 

ZJ --> 220 

val val Asp Leu Asp Lys His Met Tyr Met Gly Val Ala Pro Asp Pro 

" 235 240 

Asp He Leu val Arg ser Ser Gly G l u Thr Arg Leu ser Asn Phe Leu 

250 255 

Leu Tr P Gin Thr Thr Asn cys Leu Leu Tyr Ser Pro Lys Ala Leu Trp 
^ DU 265 270 

Pro Glu Met Gly Phe Trp Gin Val val Trp Gly He Leu Glu Phe Gin 
3 280 285 

Asn Asn Tyr Asn Tyr Leu Glu Lys Lys .Lys Lys Gin Ala 
^ 295 300 

<210> 5 

<211> 505 

<212> DNA 

<213> Helianthus annus 



<210> 6 

<211> 168 

<212> prt 

<213> Helianthus annus 



60 
120 
180 



<400> 5 

accagaaaga ttgaagggtt tatgaaagaa ttaacaattg tgaacaggta tggtgttaga 
gtcttgttta tcggcgatct taaaaggtta tacgagcccg ttagagttgc agccgagaaa 
gcaatggagg ccactgctaa caacacacat acatatcttt tagtatgtgt tgcttacact 
tcttcacacg aaatcccgcg tgccgtttat gaatcttgcg aagaaaagag tggtggaacc 240 
ggagttatga ttaatggaaa tggaagtgtg aacggagatt acagtgaaga aaagagtggt 300 
ggaaccggag ttatggtgaa tggaaatggg agtgtgaatg gagattacag taatggagat 360 
catgaggagg gggttaaagt ggtggatatt gacaaacata tgtatatggc agtggctcct 420 
gatcctgata ttttggtcag gagctcaggg gagacgaggt tgagtaactt tttgctgtgg 480 
caaaccacca actgcgtgtt gtatt 



505 
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<400> 6 



Thr Arg Lys lie Glu Gly Phe Met Lys Glu Leu Thr lie val Asn Arg 
15 10 15 

Tyr Gly val Arg Val Leu Phe lie Gly Asp Leu Lys Arg Leu Tyr Glu 
20 25 30 

Pro Val Arg Val Ala Ala Glu Lys Ala Met Glu Ala Thr Ala Asn Asn 
35 40 45 

Thr His Thr Tyr Leu Leu Val Cys Val Ala Tyr Thr Ser Ser His Glu 
50 55 60 

lie Pro Arg Ala Val Tyr Glu Ser Cys Glu Glu Lys Ser Gly Gly Thr 
65 ~ 70 ' 75 80 

Gly val Met lie Asn Gly Asn Gly Ser Val Asn Gly Asp Tyr ser Glu 
85 90 95 

Glu Lys Ser Gly Gly Thr Gly Val Met Val Asn Gly Asn Gly ser val 
100 105 110 

Asn Gly Asp Tyr Ser Asn Gly Asp His Glu Glu Gly Val Lys val val 
115 120 125 

Asp lie Asp Lys His Met Tyr Met Ala Val Ala Pro Asp Pro Asp lie 
130 135 140 

Leu Val Arg Ser Ser Gly Glu Thr Arg Leu Ser Asn Phe Leu Leu Trp 
145 150 ~ 155 160 

Gin Thr Thr Asn Cys Val Leu Tyr 
165 

<210> 7 
<211> 228 
<212> PRT , 

<213> Calendula officinalis 



<400> 7 

Met Pro Lys His Val Ala Phe lie Met Asp Gly Asn Arg Arg Trp Ala 
15 10 15 

Val Glu Lys Gly Trp Ser Pro Met Thr Gly His Ser Ala Met Arg Lys 
20 25 30 

Thr Leu Gin Ser Leu Leu Phe Arg cys ser Lys Phe Lys lie Lys Ala 
35 40 45 
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val Ser He Tyr Ala Phe Ser Thr Glu Asn Trp Thr Arg Pro Ly s Glu 

Glu val Asp Phe Leu Met Glu Met Tyr Glu Asp Leu Leu Arg Thr Asp 

75 80 

Ala Glu Glu Leu Leu ser Leu Gly cys Arg Val Ser He Met Gly L ys 

Lys Thr Asn Leu Pro Lys Ser Leu Gin Lys Leu Cys He Glu lie Glu 

105 HO 

Glu Lys ser Arg Ala Asn ser Gly Thr His val Asn Tyr Ala Leu Asn 

Tyr ser Gly Lys Tyr Asp lie He Glu Ala Cys Lys ser Val Ala Thr 

xjd 140 

Lys Val Lys Asp Gly Val He He Pro Lys Gin He Asp Glu Lys Tyr 

155 160 

Phe Lys Gin Glu Leu Gly Thr Lys Met lie Asp Phe Pro Tyr Pro Asp 

170 175 

Leu val He Arg Thr Ser Gly Glu lie Arg Leu ser Asn Phe Met Leu 

185 190 



Pro 

205 



Tr P Gin Met Ala Tyr Ser Glu Leu Tyr Phe Thr Asp Lys Tyr Phe 
Asp Phe Gly Glu Asn Asp Leu He Glu Ala Leu Leu Ala Phe Gin Lys 



225 CyS 



<210> 8 
<211> 290 
<212> PRT 

<213> Hevea brasiliensis 
<400> 8 

Met Glu Leu Tyr Asn Gly Glu Arg Pro Ser Val Phe Arg Leu Leu Gly 

10 15 

Lys Tyr Met Arg Lys Gly Leu Tyr ser lie Leu Thr Gin Gly Pro lie 

" 30 
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pro Thr His lie Ala Phe lie Leu Asp Gly Asn Arg Arg Phe Ala Lys 
35 40 45 

Lys His Lys Leu Pro Glu Gly Gly Gly His Lys Ala Gly Phe Leu Ala 
50 55 60 

Leu Leu Asn val Leu Thr Tyr cys Tyr Glu Leu Gly val Lys Tyr Ala 
65 70 75 80 

Thr lie Tyr Ala Phe Ser lie Asp Asn Phe Arg Arg Lys Pro His Glu 
85 90 95 

val Gin Tyr val Met Asp Leu Met Leu Glu Lys lie Glu Gly Met lie 
100 105 110 

Met Glu Glu ser lie lie Asn Ala Tyr Asp lie Cys val Arg Phe val 
115 120 125 

Gly Asn Leu Lys Leu Leu ser Glu Pro val Lys Thr Ala Ala Asp Lys 
130 135 140 

lie Met Arg Ala Thr Ala Asn Asn ser Lys Cys Val Leu Leu lie Ala 
145 150 155 160 

val cys Tyr Thr ser Thr Asp. Glu lie val His Ala val Glu Glu ser 
165 170 175 

ser Glu Leu Asn Ser Asn Glu val cys Asn Asn Gin Glu Leu Glu Glu 
180 185 190 

Ala Asn Ala Thr Gly Ser Ser Thr Val lie Gin Thr Glu Asn Met Glu 
195 200 205 

Ser Tyr Ser Gly lie Lys Leu val Asp Leu Glu Lys Asn Thr Tyr lie 
210 215 220 

Asn Pro Tyr Pro Asp val Leu lie Arg Thr Ser Gly Glu Thr Arg Leu 
225 ' 230 235 240 

Ser Asn Tyr Leu Leu Trp Gin Thr Thr Asn cys lie Leu Tyr Ser Pro 
245 250 255 

Tyr Ala Leu Trp Pro Glu lie Gly Leu Arg His val val Trp ser val 
260 265 270 

lie Asn Phe Gin Arg His Tyr ser Tyr Leu Glu Lys His Lys Glu Tyr 
275 .280 285 

Leu Lys 
290 
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<210> 9 

<211> 290 

<212> PRT 

<213> Hevea brasiliensis 

<400> 9 

Met Glu Leu Tyr Asn Gly Glu Arg Pro Ser Val phe Arg Leu Leu Glu 
1 5 10 15 

Lys Tyr Met Arg L ys Gly Leu Tyr ser He Leu Thr Gin Gly Pro He 
zu 25 30 

Pro Thr His He Ala Phe lie Leu Asp Gly Asn Arg Arg Phe Ala Lys 

40 45 J 

Lys His Lys Leu Pro Glu Gly Gly Gly His Lys Ala Gly Phe Leu Ala 
- >u 5 5 60 

Leu Leu Asn Val Leu Thr Tyr cys Tyr Glu Leu Gly Val Lys Tyr Ala 

70 75 80 

Thr lie Tyr Ala Phe Ser lie Asp Asn Phe Arg Arg Lys Pro His Glu 
85 90 95 

"val Gin Tyr Val Met Asp Leu Met Leu Glu Lys lie Glu Gly Met He 
100 105 110 

Met Glu Glu ser He lie Asn Ala Tyr Asp lie Cys val Arg Phe Val 
J -- L - > 120 125 

Gly Asn Leu Lys Leu Leu Ser Glu pro val Lys Thr Ala Ala Asp Lys 
XiV 135 140 

lie Met Arg Ala Thr Ala Asn Asn ser Lys cys Val Leu Leu lie Ala 
±n 150 155 160 

Val cys Tyr Thr Ser Thr Asp Glu He Val His Ala Val Glu Glu Ser 
165 170 175 

ser Glu Leu Asn Ser Asn Glu Val cys Asn Asn Gin Glu Leu Glu Glu 
180 185 190 

Ala Asn Ala Thr Gly ser Ser Thr val He Gin Thr Glu Asn Met Glu 
xy:> 200 205 

ser Tyr ser Gly He Lys Leu Val Asp Leu Glu Lys Asn Thr Tyr He 
* xu 215 220 

Asn pro Tyr Pro Asp val Leu lie Arg Thr Ser Gly Glu Thr Arg Leu 
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225 230 235 240 

Ser Asn Tyr Leu Leu Trp Gin Thr Thr Asn cys He Leu Tyr ser Pro 
245 250 255 

Tyr Ala Leu Trp Pro Glu lie Gly Leu Arg His val Val Trp Ser val 
260. 265 270 

He Asn Phe Gin Arg His Tyr Ser Tyr Leu Glu Lys His Lys Glu Tyr 
275 280 285 

Leu Lys 
290 

<210> 10 

<211> 296 

<212> PRT 

<213> Hevea brasiliensis 

<400> 10 

Met Glu lie Tyr Thr Gly Gin Arg Pro Ser val Phe Arg lie Phe Gly 
1 5 10 15 

Lys Tyr Met Arg Lys Gly Leu Tyr ser lie Leu Thr Gin Gly Pro lie 
20 25 30 

Pro Thr His Leu Ala Phe lie Met Asp Gly Asn Arg Arg Phe Ala Lys 
35 40 45 

Lys His Lys Met Lys Glu Ala Glu Gly Tyr Lys Ala Gly Tyr Leu Ala 
50 55 60 

Leu Leu Arg Thr Leu Thr Tyr Cys Tyr Glu Leu Gly val Arg Tyr Val 
65 70 75 80 

Thr lie Tyr Ala Phe Ser lie Asp Asn Phe Arg Arg Gin Pro Arg Glu 
85 90 95 

val Gin Cys val Met Asn Leu Met Met Glu Lys lie Glu Glu lie lie 
100 105 110 

val Glu Glu ser lie Met Asn Ala Tyr Asp Val Gly val Arg lie val 
115 120 125 

Gly Asn Leu Asn Leu Leu Asp Glu Pro lie Arg lie Ala Ala Glu Lys 
130 135 140 

lie Met Arg Ala Thr Ala Asn Asn Ser Gly Phe val Leu Leu lie Ala 
145 " 150 155 160.- 
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Val Ala Tyr Ser Ser Thr Asp Glu n e Gly His Ala Val Glu Glu Ser 
165. 170 175 

ser Lys Asp Lys Leu Asn ser Asn Glu Val cys Asn Asn Gly He Glu 
18 0 185 190 

Ala Glu Gin Glu Phe Lys Glu Ala Asn Gly Thr Gly Asn ser Val He 
195 200 205 

Pro Val Gin Lys Thr Glu Ser Tyr ser Gly lie Asn Leu Ala Asp Leu 
z±u 215 220 

Glu Lys Asn Thr Tyr Val Asn Pro His Pro Asp val Leu lie Arg Thr 
" 1 230 235 240 

ser Gly Leu Ser Arg Leu ser Asn Tyr Leu Leu Trp Gin Thr ser Asn 
Z4 5 250 255 

Cys lie Leu Tyr Ser Pro Phe Ala Leu Trp Pro Glu He Gly Leu Arg 
260 265 270 

His Leu Val Trp Thr Val Met Asn Phe Gin Arg His His Ser Tyr Leu 
275 280 ~ 285 

Glu Lys His Lys Glu Tyr Leu Lys 
290 295 

<210> 11 

<211> 309 

<212> PRT 

<213> Vitis sp. 



<400> 11 

Met Leu ser Phe Arg Phe Pro He Ser Ala Asp Asn Ala Arg His Thr 
5 10 15 

Phe Lys Ser Lys His Ser ser cys Thr Phe Arg ser Asn Arg He Asp 
20 25 30 

ser Phe Ser Phe Pro Pro He ser Val Pro Arg Phe His Lys Leu Arg 
3 5 40 45 

Thr Ala Lys Thr Asp Val val Gly Glu Glu Glu Ala Arg Glu Val Asn 
50 55 60 

Glu Arg Ala Glu Glu Phe pro Asp Gly Leu Arg Arg Glu Leu Met Pro 

/0 75 80 
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Glu His val Ala val lie Met Asp Gly Asn val Arg Trp Ala Gin Lys 
85 90 95 

Arg* Gly Leu Pro Ala Ala Ser Gly His Gin Ala Gly Val Arg Ser Leu 
100 105 11U 

Arg Glu Leu Val Glu Leu cys cys Lys Trp Gly lie Lys Val Leu Ser 
115 120 1" 

val Phe'Ala Phe Ser Tyr Asp Asn Trp ser Arg Ser Glu Gly Glu val 
130 135 140 

Gly Phe Leu Met ser Leu lie Glu Arg val val Lys Ala Glu Leu Pro 
145 150 155 160 

lie Leu Gly Gly Lys Ala Phe Glu cys Arg Asp Trp Gly Phe val Lys 
165 170 175 

Ala ser Glu Gin Leu Gin Leu He He Asp val Glu Glu Thr Thr Lys 
180 185 190 

Glu Asn ser Arg Leu Gin Phe lie val Ala Leu Ser Tyr ser Gly Gin 
195 200 205 

Cys Asp lie Leu Gin Ala Cys Lys Asn lie Gly His Lys val Lys Asp 
210 215 220 

Gly Leu lie Glu Pro Glu Asp He Asn Lys Ser Leu He Glu Gin Glu 
225 230 235 240 

Leu Gin Thr Asn cys Thr Glu Phe Pro Phe Pro Asp Leu Leu lie Arg 
245 250 255 

Thr Ser Gly Glu Leu Arg val Ser Asn Phe Met Leu Trp Gin lie Ala 
260 265 270 

Tvr Thr Glu Leu Cys Phe Phe ser Thr Leu Trp Pro Asp Phe Gly Lys 
275 280 285 

asd Glu Phe val Glu Ala Leu Ser Ser phe Gin Lys Arg Gin Arg Arg 
V 290 295 300 

Tyr Gly Gly Arg Asn 
305 

<210> 12 

<211> 252 

<212> PRT 

<213> oryza sativa 
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<400> 12 



Met Pro Lys His il e Ala Phe He Met Asp Gly A sn Arg Arg Tyr Ala 

Lys Phe Arg ser He Gin Glu Gly ser Gly His Arg Val Gly Phe ser 

25 30 

Ala Leu lie Ala ser Leu Leu Tyr Cys Tyr Glu Met Gly Val Lys Tyr 



45 



He Thr Val Tyr Ala Phe Ser He Asp Asn Phe Lys Arg Asp Pro Thr 



60 



Glu val Lys ser Leu Met Glu Leu Met Glu Glu Lys He Asn Glu Leu 

75 80 

Leu Glu Asn Arg Asn Val He Asn Lys Val Asn cys Lys He Asn Phe 

90 95 

Trp Gly Asn Leu Asp Met Leu ser Lys Ser Val Arg.val Ala Ala Glu 

105 110 

Lys Leu Met Ala Thr Thr Ala Glu Asn Thr Gly Leu Val Phe ser val 
cys Met Pro Tyr Asn Ser Thr Ser Glu He val Asn Ala Val Asn Lys 
val cys Ala Glu Arg Arg Asp He Leu Gin Arg Glu Asp Ala Asp ser 



160 



val Ala Asn Asn Gly val Tyr ser Asp lie ser val Ala Asp Leu Asp 

Arg His Met Tyr ser Ala Gly cys Pro Asp Pro Asp He val He Arg 

Thr ser Gly Glu Thr Arg Leu ser Asn Phe Leu Leu Trp Gin Thr Thr 

Phe ser His Leu Gin Asn Pro Asp Pro Leu Trp Pro Glu Phe ser Phe 

^x^ 220 

Lys His Leu val Trp Ala lie Leu Gin Tyr Gin Arg Val His Pro Ser 

235 240 

He Glu Gin ser Arg Asn Leu Ala Lys Lys Gin Leu 
245 250 

<210> 13 

Page 12 



WO 2004/044173 Jg^ (&^T/US:?0ILV0361i>4 

<211> 299 
<212> PRT 
<213> Oryza sativa 

<400> 13 

Met Leu Gly Ser Leu Met Ser Tyr Leu Pro Ser val Asp Ser Lys Thr 
1 5 10 15 

Glu Asn Thr Asp G~lu Leu lie Ala Thr Gly Val Leu Ala Ser Leu Gin 
20 25 30 

Asn Phe lie Arg Lys Cys lie Val Ala val Leu ser Tyr Gly Pro Met 
35 40 45 

pro Lys His lie Ala Phe lie Met Asp Gly Asn Arg Arg Tyr Ala Lys 
50 55 60 

Phe Arg Ser lie Gin Glu Gly Ser Gly His Arg Val Gly Phe Ser Ala 
65 70 75 80 

Leu lie Ala Ser Leu Leu Tyr cys Tyr Glu Met Gly val Lys Tyr lie 
85 90 95 

Thr Val Tyr Ala Phe Ser lie Asp Asn Phe Lys Arg Asp Pro Thr Glu 
100 105 110 

val Lys Ser Leu Met Glu Leu Met Glu Glu Lys lie Asn Glu Leu Leu 
115 120 125 

Glu Asn Arg Asn val lie Asn Lys val Asn Cys Lys lie Asn Phe Trp 
130 135 140 

Gly Asn Leu Asp Met Leu Ser Lys ser val Arg val Ala Ala Glu Lys 
145 150 155 160 

Leu. Met Ala Thr Thr Ala Glu Asn Thr Gily Leu Val Phe Ser Val Cys 
165 170 175 

Met pro Tyr Asn ser Thr ser Glu lie Val Asn Ala val Asn Lys val 
180 185 190 

cys Ala Glu Arg Arg Asp lie Leu Gin Arg Glu Asp Ala Asp Ser Val 
195 200 205 

Ala Asn Asn Gly val Tyr Ser Asp lie Ser Val Ala Asp Leu Asp Arg 
210 215 220 

His Met Tyr Ser Ala Gly Cys Pro Asp Pro Asp lie Val He Arg Thr 
225 230 235 240 
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# 



Ser Gly Glu Thr Arg Leu ser Asn Phe Leu Leu Trp Gin Thr Thr Phe 
£H:> 250 255 

ser His Leu Gin Asn Pro Asp Pro Leu Trp Pro Glu Phe Ser Phe Lys 
^ DU 265 * 270 

His Leu val xrp Ala He Leu Gin Tyr Gin Arg val His Pro ser lie 



285 



Glu Gin ser Arg Asn Leu Ala Lys Lys Gin Leu 



<210> 14 

<211> 322 

<212> PRT 

<213> Glycine max 



<400> 14 

Met Phe ser Leu Arg Leu Pro He Pro Leu val Lys Thr Pro Pro Ser 

10 15 

Pro ser Cys Tyr Tyr ser His Tyr Tyr His Tyr Arg Tyr Arg T yr Arg 

cys Tyr His Pro Phe His His Arg ser Gin Thr Gin Ser Leu He Val 

4U 45 

Ser Lys Arg Gly Ser Ala lie Ala Lys Cys His Ala Asp Ser val Thr 

>b 60 

Leu Arg Asp Asp Gly val ser Leu Ala Gin Glu ser Leu Glu Pro Leu 

75 80 

Pro Ala Glu Leu Ala Ala Glu Met Met Pro Lys His Val Ala Val He 

90 95 

Met Asp Gly Asn Gly Arg Trp Ala Lys Val Lys Gly Leu Pro Pro ser 

105 110 

Ala Gly His Gin Ala Gly val Gin ser Leu Arg L ys Met Val Arg Leu 

Cys cys ser Trp Gly He Lys Val Leu Thr Val Phe Ala Phe Ser Thr 

!40 

145 TrP Val Ar9 gg Lvs val G ^ val Asp Phe Leu Met Arg Leu 

155 160 
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Phe Glu Arg Thr lie Asn Ser Glu val Gin Thr Phe Lys Arg Glu 
165 170 175 



lie Arg He Ser val lie Gly Asp ser Ser Arg Leu Pro Glu Ser Leu 
180 185 190 



Lys Arg Met lie Ala ser Ala Glu Glu Asp Thr Lys Gin Asn Ser 
195 200 205 



Phe Gin Leu lie val Ala val Gly Tyr Ser Gly Lys Tyr Asp Val val 
210 215 ' 220 



Gin Ala Cys Lys ser Val Ala Lys Lys val Lys Asp Gly His He His 
225 230 235 ~ 240 



Leu Asp Asp lie Asn Glu Asn lie lie Glu Gin Glu Leu Glu Thr Asn 



Cys Thr Glu Phe Pro Tyr Pro Asp Leu Leu lie Arg Thr Ser Gly Glu 
260 265 ~ 270 

Leu Arg Val Ser Asn Phe Leu Leu Trp Gin Leu Ala Tyr Thr Glu Leu 
275 280 285 

Tyr Phe Asn Arg Glu Leu Trp Pro Asp Phe Gly Lys Asp Glu Phe Val 
290 295 300 

Asp Ala Leu Ser ser Phe Gin Gin Arg Gin Arg Arg Tyr Gly Gly Arg 
305 310 315 320 

His Ser 

<210> 15 

<211> 266 

<212> PRT 

<213> Triticum aestivum 

<400> 15 

Met Pro Leu ser Asn Ser Thr ser Ser val Pro Ala val Thr Val Pro 
1 5 10 . 15 

Ala Ala Glu Glu Leu Leu Ser Gin Gly Leu Arg Ala Glu ser Leu Pro 
20 25 30 

Arg His val Ala Leu Val Met Asp Gly Asn Ser Arg Trp Ala Ala Ala 
35 40 45 
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Arg Gly Leu Pro Pro Thr Asp Gly His Glu His Gly Met Arg Ala Leu 

Met Arg Thr Val Arg Leu Ser Arg Ala Trp Gly He Arg Val Leu Thr 

75 80 

Ala Phe Gly Phe Ser Leu Glu Asn Trp Asn Arg Pro Lys Ala Glu val 
Bi 90 95 

Asp Phe Leu Met Ala Leu lie Glu Arg Phe He Asn Asp Asn Leu Ala 
xuu 105 HO 

Glu Phe Leu Arg Glu Gly Thr Arg Leu Arg He He Gly Asp Arg Ser 

Arg Leu Pro He ser Val Gin Lys Thr Ala Arg Asp Ala Glu Glu Ala 

140 

Thr Arg Asn Asn Ser Gin Leu Asp Leu val Leu Ala He Ser Tyr ser 

155 160 

Gly Arg Met Asp lie val Gin Ala cys Arg Asn Leu Ala Gin Lys Val 

170 175 

Asp Ala Lys Leu Leu Arg Pro Glu Asp He Asp Glu ser Leu Phe Ala 
u 185 190 

Asp Glu Leu Gin Thr ser Glu Thr ser cys Pro Asp Leu Leu He Arg 
- L:7 - J 200 205 

Thr ser Gly Glu Leu Arg Leu Ser Asn Phe Leu Leu Trp Gin Ser Ala 

215 220 

Tyr ser Glu Leu Phe Phe Thr Asp Thr Leu Trp Pro Asp Phe Gly Glu 

" u 235 240 

Ala Gin Tyr Leu Gin Ala Met Met Ala Phe Gin Ser Arg Asp Arg Arg 
^ 250 255 



Phe Gly Arg Arg Lys Asn Asn Ala Ala Leu 
260 ?aK 



265 

<210> 16 
<211> 287 
<212> PRT 

<213> Dimorphotheca sinuata 
<400> 16 

Met Leu Asn Leu Pro Leu Tyr Leu Pro Lys Tyr Pro cys Tyr Phe Pro 
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15 10 15 



Ala Ser Leu Ser Thr Asn His His Arg Gly Leu Tyr Val Phe Asn Gin 
20 25 30 

ser Asp Thr Thr Gly Gly Gly lie Asn Ser Leu Glu Glu Arg lie Thr 
35 40 45 

Pro Ala Gly Leu Lys His Glu Leu Met Pro Lys His Val Ala val lie 
50 55 60 

Met Asp Gly Asn Arg Arg Trp Ala Arg Ser Arg Gly Leu Met Pro Asp 
65 70 75 80 

Ala Gly Tyr Met Glu Gly Ala Arg Ser Leu Lys Val Met Val Glu Leu 

85 . 90 95 

Cys Arg Lys Trp Gly lie Gin val Leu Thr val Phe Ala Phe ser Ala 
100 105 110 

Asp Asn Trp Leu Arg Pro Lys val Glu val Asp Phe Leu Met Gly Leu 
115 120 125 

lie Glu ser val Leu Lys Asp Glu Val val His Met lie Lys Glu Gly 
130 135 140 

lie Gin Leu ser Val lie Gly Asp Thr Ser Lys Leu Pro Lys ser val 
145 150 155 160 

Lys Arg lie lie Thr Tyr Ala Glu Asn lie Thr Lys Asn Asn ser Gin 
165 170 175 

Leu Asn Leu val val Ala lie Asn Tyr Ser Gly Lys Tyr Asp lie val 
180 185 190 

Gin Ala Cys Gin Ser He Ala Leu Lys val Lys Asp Gly Val lie Gin 
195 200 205 

Pro Glu Glu lie Asn Glu Phe Thr He Glu Asn Glu Leu Gly Thr Asn 
210 215 220 

Cys lie Pro Phe Pro His Pro Asp Leu Leu lie Arg Thr Ser Gly Glu 
225 230 235 240 

Leu Arg Val Ser Asn Phe Phe Leu Trp Gin Leu Ala Tyr Thr Glu Leu 
245 250 255 

Tyr Phe Ser Glu Thr Leu Trp pro Asp Phe Gly Glu Asp Glu Leu Leu 
260 265 270 

His Ala Leu Asn Thr Phe Gin His Arg Arg Arg Arg Tyr Gly Gly 
275 280 ~ 285 
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<210> 17 

<211> 750 

<212> DNA 

<213> Micrococcus luteus 



<300> 

<301> shimizu.N., Koyama.T. and Ogura.K. 

S^^^^^gfc"- KS-SStf UfUajag -Phosphate 

<303> j. Biol. chem. 

<304> 273 

<305> 31 

<306> 19476-19481 

<307> 1998-07-31 

<308> ABO04319 

<309> 1997-05-29 

<400> 17 

atgtttccaa ttaagaagcg aaaagcaata aaaaataata acattaatgc ggcacaaatt 60 

ccgaaacata ttgcaatcat tatggacgga aatggccgat gggcaaaaca gaaaaaaatg 120 

ccgcgcataa aaggacatta tgaaggcatg cagaccgtaa agaaaatcac aagatatgct 180 

agtgatttag gtgtaaagta cttaacgctg tacgcatttt caactgaaaa ttggtctcgt 240 

cctaaagatg aggttaatta cttgatgaaa ctaccgggtg attttttaaa cacattttta 300 

ccggaactca ttgaaaaaaa tgttaaagtt gaaacgattg gctttattga tgatttaccg 360 

gaccatacaa aaaaagcagt gttagaagcg aaagagaaaa cgaaacataa tacaggttta 420 

acgctcgtgt ttgcactgaa ttatggtggg cgtaaagaaa ttatttcagc agtgcagtta 480 

atcgcagagc gttacaaatc tggtgaaatt tctttagatg aaattagtga aactcatttt 540 

aatgaatatt tatttacagc aaatatgcct gatcctgagt tgttaatcag aacttccggt 600 

gaagaacgtt taagtaactt tttaatttgg caatgttcat atagtgagtt tgtatttata 660 

gatgaattct ggccggattt taatgaagaa agtttagcac aatgtatatc aatatatcag 720 
aatcgtcatc gacgttttgg tggattataa 



750 



<210> 18 
<211> 249 
<212> PRT 
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<213> Micrococcus luteus 

<400> 18 

Met Phe Pro lie Lys Lys Arg Lys Ala He Lys Asn Asn Asn lie Asn 
1 5 10 15 

Ala Ala Gin lie Pro Lys His lie Ala He lie Met Asp Gly Asn Gly 
20 25 30 

Arg Trp Ala Lys Gin Lys Lys Met Pro Arg lie Lys. Gly His Tyr Glu 
35 40 45 

Gly Met Gin Thr val Lys Lys lie Thr Arg Tyr Ala ser Asp Leu Gly 
50 55 60 

val Lys Tyr Leu Thr Leu Tyr Ala Phe Ser Thr Glu Asn Trp ser Arg 
65 70 ' 75 80 

pro Lys Asp Glu val Asn Tyr Leu Met Lys Leu Pro Gly Asp Phe Leu 
85 90 95 

Asn Thr Phe Leu Pro Glu Leu lie Glu Lys Asn val Lys Val Glu Thr 
100 105 110 

lie Gly Phe lie Asp Asp Leu Pro Asp His Thr Lys Lys Ala Val Leu 
115 120 125 

Glu' Ala Lys Glu Lys Thr Lys His Asn Thr Gly Leu Thr Leu val Phe 
130 135 140 

Ala Leu Asn Tyr Gly Gly Arg Lys Glu lie lie ser Ala val Gin Leii 
145 150 155 160 

lie Ala Glu Arg Tyr Lys Ser Gly Glu lie Ser Leu Asp Glu lie ser 
165 170 175 

Glu Thr His Phe Asn Glu Tyr Leu Phe Thr Ala Asn Met Pro Asp Pro 
180 185 190 

Glu Leu Leu lie Arg Thr Ser Gly Glu Glu Arg Leu Ser Asn Phe Leu 
195 200 205 

lie Trp Gin Cys ser Tyr Ser Glu Phe val Phe lie Asp Glu Phe Trp 
210 215 220 

pro Asp Phe Asn Glu Glu Ser Leu Ala Gin Cys lie ser lie Tyr Gin 
225 230 235 240 

Asn Arg His Arg Arg Phe Gly Gly Leu 
245 
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<210> 19 

<211> 861 

<212> DNA 

<213> Saccharomyces cerevisiae 



<400> 19 

atggaaacgg atagtggtat acctggtcat tcatttgtgt taaagtggac aaaaaacatc 
ttttcgcgca cattgcgtgc atctaactgt gtacctagac atgttgggtt catcatggat 
gggaacagga gattcgctag aaagaaagag atggacgtaa aggagggcca cgaggcagga 
tttgttagta tgagtagaat cttagaactg tgttatgaag caggagtcga tacggctacc 
gtgtttgcct tttcaattga aaatttcaag aggagctcac gggaagttga atcactgatg _ 
actttagcgc gcgaaaggat acgacaaatc acagaacgtg gagagctggc ctgtaagtat 360 
ggggtacgca ttaaaattat cggcgatctc tctttgttgg ataagtctct attagaagat 420 
gttcgggttg ctgtggaaac tacaaagaac aacaaaaggg ccacgttaaa tatctgcttt 
ccatatacag gcagggaaga aatcttgcat gccatgaaag aaacaattgt tcaacataag 
aagggcgccg ctatagacga aagcacgtta gaatcgcatc tctacacggc gggggtaccc 
cctttagatt tattgattag gacaagtggc gtttccagat taagtgactt tttgatatgg 
caggcatcga gtaagggcgt acgcatcgaa ttgctggatt gtttatggcc agagtttgga 720 
cctatacgga tggcatggat tttattaaaa ttttcgtttc acaaatcctt tttaaacaaa 
gagtacagat tagaggaagg tgattatgac gaggaaacca atggggaccc catcgatttg 
aaagaaaaaa agttgaatta a 



<210> 20 
<211> 286 
<212> prt 

<213> Saccharomyces cerevisiae 
<400> 20 

Met Glu Thr Asp ser Gly He Pro Gly His Ser Phe Val Leu Lys Trp 

10 15 

Thr Lys Asn lie Phe Ser Arg Thr Leu Arg Ala Ser Asn Cys val Pro 

^ 30 

Arg His val Gly Phe He Met Asp Gly Asn Arg Arg Phe Ala Arg Lys 



Lys glu Met Asp val Lys Glu Gly His Glu Ala Gly Phe Val ser Met 

3;> 60 
Page 20 



60 
120 
180 
240 
300 



480 
540 
600 
660 



780 
840 
861 
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ser Arg He Leu Glu Leu Cys Tyr Glu Ala Gly val Asp Thr Ala Thr 



val Phe Ala Phe Ser lie Glu Asn Phe Lys Arg ser ser Arg Glu val 
85 90 95 



Glu Ser Leu Met Thr Leu Ala Arg Glu Arg lie Arg Gin lie Thr Glu 
100 105 110 



Arg Gly Glu Leu Ala Cys Lys Tyr Gly Val Arg lie Lys lie lie Gly 
115 120 125 



Asp Leu Ser Leu Leu Asp Lys Ser Leu Leu Glu Asp val Arg Val Ala 
130 . 135 140 



Val Glu Thr Thr Lys Asn Asn Lys Arg Ala Thr Leu Asn lie Cys Phe 
145 150 155 160 



pro Tyr Thr Gly Arg Glu Glu lie Leu His Ala Met Lys Glu Thr lie 
165 170 175 



val Gin His Lys Lys Gly Ala Ala lie Asp Glu ser Thr Leu Glu Ser 
180 185 190 



His Leu Tyr Thr Ala Gly Val Pro Pro Leu Asp Leu Leu lie Arg Thr 
195 200 205 



Ser Gly Val Ser Arg Leu Ser Asp Phe Leu lie Trp Gin Ala Ser ser 
210 215 220 



Lys Gly Val Arg lie Glu Leu Leu Asp Cys Leu Trp Pro Glu Phe Gly 
225 230 235 240 



Pro lie Arg Met Ala Trp lie Leu Leu Lys Phe ser Phe His Lys Ser 
245 250 255 



Phe Leu Asn Lys Glu Tyr Arg Leu Glu Glu Gly Asp Tyr Asp Glu Glu 
260 265 270 



Thr Asn Gly Asp Pro lie Asp Leu Lys Glu Lys Lys Leu Asn 
275 280 285 



<210> 21 

<211> 1032 

<212> DNA 

<213> saccharomyces cerevisiae 



<400> 21 
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atgaaaatgc ccagtattat tcagattcag tttgtagccc taaaaaggct tttggtagaa 60 

accaaagaac agatgtgctt cgcagtgaaa agtatatttc agagagtatt tgcgtgggtt 120 

atgtcattaa gcttgttttc atggttttat gtaaatcttc agaatatttt gataaaagca 180 

ttaagggtag ggccagtgcc tgaacatgtc tcctttatca tggatggtaa ccggagatat 240 

gccaagtcaa gaaggctacc agtaaaaaaa ggccatgaag ctggtgggtt aacgttacta 300 

acactactgt atatctgcaa aagattgggt gtaaaatgtg tttccgccta tgcattttct 360 

attgaaaatt ttaatagacc aaaagaagaa gtagatacgc taatgaattt gtttacggta 420 

aagcttgatg aattcgcaaa aagagccaag gactataagg atcccttata cggatctaaa 480 

ataagaatag taggtgatca atctttacta tctccagaaa tgagaaaaaa aattaaaaaa 540 

gtggaagaaa tcacacagga tggagacgat ttcactttat ttatatgttt tccttacact 600 

tcaagaaatg atatgttaca tactattcgt gattcagttg aagaccattt ggaaaataaa 660 

tcaccaagga ttaatataag aaaatttact aataaaatgt acatgggttt ccattccaat 720 

aaatgtgaat tattaatcag aacaagtggg cataggaggc tctcagacta tatgctatgg 780 

caagtacatg aaaatgccac cattgaattt agtgatacgt tgtggccaaa ttttagcttc 840 

tttgctatgt acctgatgat tctcaaatgg tccttctttt ccaccattca aaaatataat 900 

gagaagaatc actcattgtt tgaaaaaata catgaaagcg ttccttcaat atttaaaaaa 960 

aagaaaacag ctatgtcttt gtacaacttt ccaaaccccc ccatttcagt ttcggttaca 1020 

ggagatgaat aa 1Q32 

<210> 22 
<211> 343 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 22 



Leu Leu Val Glu Thr Lys Glu Gin Met Cys Phe Ala Val Lys Ser He 
20 25 30 

Phe Gin Arg Val Phe Ala Trp Val Met Ser Leu ser Leu Phe Ser Tro 
35 40 45 

Phe Tyr val Asn Leu Gin Asn lie Leu He Lys Ala Leu Arg val Gly 
50 55 60 

Pro Val pro Glu His Val Ser Phe lie Met Asp Gly Asn Arg Arg Tyr 
65 70 75 y 80 
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Ala Lys Ser Arg Arg Leu Pro val Lys Lys Gly His Glu Ala Gly Gly 
85 90 95 

Leu Thr Leu Leu Thr Leu Leu Tyr lie Cys Lys Arg Leu Gly Val Lys 
100 105 ~ 110 

cys Val Ser Ala Tyr Ala Phe Ser lie Glu Asn Phe Asn Arg Pro Lys 
115 120 125 

Glu Glu val Asp Thr Leu Met Asn Leu Phe Thr val Lys Leu Asp Glu 
130 135 140 

Phe Ala Lys Arg Ala Lys Asp Tyr Lys Asp Pro Leu Tyr Gly Ser Lys 
145 150 155 160 

lie Arg He val Gly Asp Gin ser Leu Leu Ser pro Glu Met Arg Lys 
165 170 175 

Lys lie Lys Lys Val Glu Glu lie Thr Gin Asp Gly Asp Asp Phe Thr 
180 185 190 

Leu Phe lie Cys Phe Pro Tyr Thr Ser Arg Asn Asp Met Leu His Thr 
195 200 205 

lie Arg Asp Ser val Glu Asp His Leu Glu Asn Lys Ser Pro Arg lie 
210 215 220 

Asn lie Arg Lys Phe Thr Asn Lys Met Tyr Met Gly Phe His Ser Asn 
225 " 230 235 240 

Lys cys Glu Leu Leu lie Arg Thr ser Gly His Arg Arg Leu Ser Asp 
245 250 ~ " 255 

Tyr Met Leu Trp Gin Val His Glu Asn Ala Thr lie Glu Phe Ser Asp 
260 265 270 

Thr Leu Trp Pro Asn Phe ser Phe Phe Ala Met Tyr Leu Met lie Leu 
275 280 285 

Lys Trp Ser Phe Phe Ser Thr lie Gin Lys Tyr Asn Glu Lys Asn His 
290 295 300 

ser Leu Phe Glu Lys lie His Glu ser Val Pro Ser lie Phe Lys Lys 
305 310 315 320 

Lys Lys Thr Ala Met Ser Leu Tyr Asn Phe Pro Asn Pro Pro lie Ser 
325 330 335 

Val ser val Thr Gly Asp Glu 
340 

<210> 23 
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<211> 271 
<212> PRT 
<213> Arabidopsis 



<400> 23 

Met Asn Thr Leu Glu Glu Val Asp Glu Ser Thr His He Phe Asn Ala 
1 5 10 15 

Leu Met ser Leu Met Arg Lys Phe Leu Phe Arg Val Leu Cys Val Gly 
*U 25 30 

Pro He Pro Thr Asn lie Ser Phe He Met Asp Gly Asn Arg Arg Phe 

40 45 

Ala Lys Lys His Asn Leu lie Gly Leu Asp Ala Gly His Arg Ala Gly 
>v 55 60 

Phe He Ser Val Lys Tyr lie Leu Gin Tyr Cys Lys Glu He Gly val 

/u 75 80 

Pro Tyr Val Thr Leu His Ala Phe Gly Met Asp Asn Phe Lys Arg Gly 
85 90 95 

Pro Glu Glu val Lys Cys Val Met Asp Leu Met Leu Glu Lys val Glu 
xuu 105 HO 

Leu Ala lie Asp Gin Ala val ser Gly Asn Met Asn Gly val Arg He 

120 125 

He Phe Ala Gly Asp Leu Asp ser Leu Asn Glu His Phe Arg Ala Ala 
±iv 135 140 

Thr Lys Lys Leu Met Glu Leu Thr Glu Glu Asn Arg Asp Leu He Val 

Xb ° 155 160 

Val val cys val Ala Tyr Ser Thr ser Leu Glu lie Val His Ala Val 
165 170 175 

Arg Lys Ser cys Val Arg Lys Cys Thr Asn Gly Asp Asp Leu val Leu 
J- 50 185 190 

Leu Glu Leu ser Asp val Glu Glu cys Met Tyr Thr ser lie val Pro 
xy:> 200 205 

Val Pro Asp Leu Val He Arg Thr Gly Gly Gly Asp Arg Leu Ser Asn 

^J-j 220 

Phe Met Thr Trp Gin Thr Ser Arg ser Leu Leu His Arg Thr Glu Ala 

230 235 "" 240 
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Leu Trp pro Glu Leu Gly Leu Trp His Leu val Trp Ala He Leu Lys 
245 250 255 

phe Gin Arg Met Gin Asp Tyr Leu Thr Lys Lys Lys Lys Leu Asp 
260 265 270 

<210> 24 

<211> 295 

<212> PRT 

<213> Arabidopsis 



<400> 24 

Met Ala Glu Leu Pro Gly Gin lie Arg His lie Gly Gly Arg Met ser 
1 5 10 15 

Gin Leu Leu Glu Gin lie Tyr Gly phe ser Arg Arg Ser Leu Phe Arg 
20 25 30 

Val lie Ser Met Gly Pro lie Pro Cys His lie Ala Phe lie Met Asp 
35 40 45 

Gly Asn Arg Arg Tyr Ala Lys Lys Cys Gly Leu Leu Asp Gly ser Gly 
50 . 55 60 

His Lys Ala Gly Phe Ser Ala Leu Met Ser Met Leu Gin Tyr Cys Tyr 
65 70 75 80 

Glu Leu Gly lie Lys Tyr val Thr lie Tyr Ala Phe Ser lie Asp Asn 
85 90 95 

phe Arg Arg Lys Pro Glu Glu Val Glu Ser val Met Asp Leu Met Leu 
100 105 110 

Glu Lys lie Lys Ser Leu Leu Glu Lys Glu Ser lie Val His Gin Tyr 
115 120 125 

Gly lie Arg val Tyr Phe lie Gly Asn Leu Ala Leu Leu Asn Asp Gin 
130 135 140 

Val Arg Ala Ala Ala Glu Lys val Met Lys Ala Thr Ala Lys Asn Ser 
145 150 155 160 

Arg val val Leu Leu lie cys lie Ala Tyr Asn ser Thr Asp Glu lie 
165 170 175 

Val Gin Ala val Lys Lys ser Cys lie Asn Lys ser Asp Asn lie Glu 
180 185 190 

. Page 25 



WO 2004/044173 _ , ^ 

PCTYUS2003/036164 

Ala ser Asn Tyr Lys His Glu Asp Ser Asp Ser Asp He Glu Gly Thr 

200 205 

Asp Met Glu Asn Gin Glu Lys Lys lie Gin Leu val Asp He Glu Glu 

Asn Met Gin Met ser val Ala Pro Asn Pro Asp He Leu He Arg ser 

235 240 

ser Gly Glu Thr Arg Leu ser Asn Phe Leu Leu Trp Gin Thr Gly Asn 

3 250 255 

Thr Gin Leu cys Ser Pro Ala Ala Leu Trp Pro Glu He Gly Leu Arg 
u 265 270 

His Leu Leu Trp Ala lie Leu Asn Phe Gin Arg Asn His Ser Tyr Leu 



285 



Glu kXS Ar 9 L ys Lys Gin Leu 
290 295 



<210> 25 

<211> 303 

<212> prt 

<213> Arabidopsis 

<400> 25 



Met Leu ser Leu Leu Ser ser as P ser Ser Leu Leu ser Leu Leu Phe 

10 15 

Leu Phe Leu lie Pro cys Leu Phe lie Thr Ser Tyr He Gly Phe Pro 

25 30 

val Phe Leu Leu Lys Leu He Gly L eu He Lys He Lys Ala Ala Arg 

Asp Asn Glu Lys Arg Asp Glu Gly Thr Tyr Val Val Arg Glu Asp Gly 

D:> 60 

Leu Gin Arg Glu Leu Met Pro Arg His Val Ala Phe lie Leu Asp Gly 

75 80 

Asn Arg Arg Trp Ala Lys Arg Ala Gly Leu Thr Thr Ser Gin Gly His 



Glu Ala Gly Ala Lys Arg Leu il e A sp He Ala Glu Leu cys Phe Glu 
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Leu Gly Val His Thr val Ser Ala Phe Ala Phe Ser Thr Glu Asn Trp 
115 120 125 

Gly Arg Asp Lys lie Glu lie Asp Asn Leu Met ser Leu lie Gin His 
130 135 140 

Tyr Arg Asn Lys Ser Asn lie Lys Phe Phe His Arg Ser Glu Val Arg 
145 150 155 - 160 

Val Ser val lie Gly Asn Lys Thr Lys lie Pro Glu Ser Leu Leu Lys 
165 170 175 

Glu lie His Glu lie Glu Glu Ala Thr Lys Gly Tyr Lys Asn Lys His 
180 185 190 

Leu lie Met Ala val Asp Tyr Ser Gly Lys Phe Asp lie Met His Ala 
195 200 205 

Cys Lys Ser Leu val Lys Lys Ser Glu Lys Gly Leu He Arg Glu Glu 
210 215 220 

Asp val Asp Glu Ala Leu lie Glu Arg Glu Leu Leu Thr Asn Cys Ser 
225 230 235 240 

Asp Phe Pro Ser Pro Asp Leu Met lie Arg Thr Ser Gly Glu Gin Arg 
245 250 255 

lie Ser Asn Phe Phe Leu Trp Gin Leu Ala Tyr Ser Glu Leu Phe Phe 
260 265 270 

Ser Pro val Phe Trp Pro Asp Phe Asp Lys Asp Lys Leu Leu Glu Ala 
275 280 285 

Leu Ala Ser Tyr Gin Arg Arg Glu Arg Arg Phe Gly Cys Arg Val 
290 295 300 

<210> 26 

<211> 244 

<212> PRT 

<213> Arabidopsis 



<400> 26 

Met Gly Glu Lys Gin Lys Arg Gly Arg Asn lie Met Pro Lys His val 

1 5 10 15 

Ala val lie Leu Asp Gly Asn Arg Arg Trp Ala Glu Lys Arg Gly Leu 
20 25 30 

Gly Thr Ser Glu Gly His Glu Ala Gly Ala Arg Arg Leu Met Glu Asn 
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35 



40 



45 



Ala Lys Asp cys Phe Ala Met Gly Thr Asn Thr He Ser Leu Phe Ala 
5< J 55 60 

Phe ser Thr Glu Asn Trp Glu Arg pro Glu Asp Glu val Lys cys Leu 
65 70 75 80 

Met Ala Leu Phe Glu Lys Tyr Leu Ala Ser Asp Met Pro Tyr Leu Arg 
85 90 95 

ser Asp Lys lie Lys lie Ser val lie Gly Asn Arg Thr Lys Leu Pro 
100 105 110 

Glu ser Leu Leu Gly Leu lie Glu Glu Val Glu Glu Ala Thr Lys Ser 
115 120 125 

Tyr Glu Gly Lys Asn Leu lie He Ala lie Asp Tyr ser Gly Arg Tyr 
130 135 140 

Asp He Leu Gin Ala Cys Lys Ser Leu Ala Asn Lys val Lys Asp Gly 
145 150 155 1 6 6 

Leu He Gin val Glu Asp lie Asn Glu Lys Ala Met Glu Lys Glu Leu 
165 170 175 

Leu Thr Lys cys Ser Glu Phe Pro Asn Pro Asp Leu Leu He Arg Thr 
ISO 185 190 

Ser Gly Glu Gin Arg lie Ser Asn Phe Phe Leu Trp Gin ser Ala Tyr 
195 200 205 

Thr Glu Leu Tyr Phe Pro Thr val Leu Trp pro Asp Phe Gly Glu Ala 
210 215 220 

Glu Tyr Leu Glu Ala Leu Thr Trp Tyr Gin Gin Arg Gin Arg Arg Phe 

225 230 235 240 

Gly Arg Arg val 

<210> 27 

<211> 21 

<212> PRT 

<213> Artificial sequence 



<223> Domain iv consensus sequence from Figure 1 alignment 



<220> 
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<220> 

<221> MISC_F£ATURE 

<222> (5).. (5) 

<223> Xaa = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (9) . . C9) 

<223> xaa = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (13) . . (13) 

<223> xaa = any amino acid 

<220> 

<221> MISCFEATURE 

<222> (15) . . (15) 

<223> xaa = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (16).. (16) 

<223> xaa = any amino acid 

<220> . 

<221> MISC_FEATURE 

<222> (18) . . (18) 

<223> xaa = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (19) . . (19) 

<223> Xaa = any amino acid 



^^T/US2003/036164 
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<220> 

<221> MISC_FEATURE 

<222> C20) . . C20) 

<223> xaa = any amino acid 



<400> 27 

Tyr ser Gly Arg xaa Glu lie val xaa Ala val Lys xaa Sen Xaa Xaa 

10 15 



Lys Xaa xaa xaa Gly 
20 



<210> 28 

<211> 47 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Domain v consensus sequence from Figure 1 alignment 

<220> 

<221> MISC_FEATURE 

<222> (2).. (2) 

<223> xaa = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (3) . . (3) 

<223> Xaa = any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (4) . . C4) 

.•<223> xaa = any amino acid 

<220> 

<221> MIS C_FE ATUR E 
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<222> (7).. (7) 

<223> Xaa = any amino acid 

<220> 

<221> MIS COFEATURE 

<222> (8).. (8) 

<223> Xaa = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (9) . . (9) 

<223> Xaa = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (11) . . Cll) 

<223> Xaa = any amino acid 

<220> 

<221> MIS COFEATURE 

<222> C13) . . (13) 

<223> xaa = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (16) . . (16) 

<223> Xaa = any amino acid 

<220> 

<221> MISC^FEATURE 

<222> (17).. (17) 

<223> Xaa = any amino acid 

.* 

<220> 

<221> MISC_FEATURE 



^^T/US2003/036 ] 64 
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<222> (19) - - (19) 

<223> xaa = any amino acid 

<220> 

<2 2 1> MIS COFEATURE 

<222> (20) . . (20) 

<223> Xaa = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (21) . . (21) 

<223> Xaa « any amino acid 

<220> 

<221> MI 5 COFEATURE 

<222> (22).. (22) 

<223> xaa = any amino acid 

<220> 

<221> MIS COFEATURE 

<222> (24) . . (24) 

<223> xaa = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (35).. (35) 

<223> Xaa = any amino acid 

<400> 28 

lie Xaa xaa Xaa Glu He Xaa xaa Xaa Leu xaa Asp xaa Glu Leu Xaa 
1 5 10 15 

Xaa Asn xaa xaa Xaa Xaa Pro xaa Pro Asp Leu Leu He Arg Thr ser 



25 



30 



Gly Glu xaa Arg Leu Ser Asn Phe Leu Leu Trp Gin Thr Ala Tyr 
35 40 45 
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<210> 29 

<211> 13 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> consensus Sequence from Apfel CM et al . O Bacteriol 1999 Jan: 181 
(2) : 483-492) 

<220> 

<221> MISC_FEATURE 

<222> (2). .(2) 

<223> Xaa = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (3).. (3) 

<223> Xaa = any amino acid 



<220> 

<221> MISC_FEATURE 

<222> C4) . .(4) 

<223> Xaa = any amino acid 

<220> 

<221> MISC__FEATURE 

<222> (5). .(5) 

<223> xaa = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (12) . . (12) 

<223> xaa = w or Y or F 



<220> 
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<221> MISC_FEATURE 
<222> CIO) . . CIO) 
<223> Xaa = R or G 

<400> 29 

His Xaa xaa Xaa Xaa Met Asp Gly Asn Xaa Arg xaa Ala 
x 5 10 

<210> 30 
<211> 5 
<212> prt 

<213> Artificial Sequence 
<220> 

<223> gnjfjgfjguenc. f rom Apfel CM et al . O Bacteriol 1999 Ja n;18 
<220> 

<221> MISC_FEATURE 

<222> C3) . . C3) 

<223> xaa = any amino acid 

<220> 

<221> MISC_ FEATURE 

<222> C4);.C4) 

<223> Xaa = any amino acid 

<400> 30 

Gly His Xaa xaa Gly 
1 5 

<210> 31 

<211> 23 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> C2) n flSi-492 5 ) eqUenCe fr ° m Apfel CM et »!■ O Bacteriol 1999 Dan; 181 
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<220> 

<221> MIS COFEATURE 

<222> (1) . . CD 

<223> Xaa = T or s 

<220> 

<221> MISC__FEATURE 

<222> (2) . . C2) 

<223> Xaa = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (3) . . (3) 

<223> xaa = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> C7) . . (7) 

<223> xaa = s or T 

<220> 

<221> MI5C_FEATURE 

<222> (10) . . CIO) 

<223> xaa = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (11) . . (11) 

<223> xaa = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> C13). .(13) 

<223> xaa = any amino acid 



^^T/US2003/036164 
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<220> 

<221> MI S C_ F E ATU RE 

<222> (14) . . (14) 

<223> Xaa = any amino acid 

<220> 

<221> MISC^FEATURE 

<222> (15) . . (15) 

<223> Xaa = any amino acid 

<220> 

<221> MIS COFEATURE 

<222> (18) . . (18) 

<223> Xaa = any amino acid 

<220> 

<221> MI 5 COFEATURE 

<222> (19) . . (19) 

<223> Xaa = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (22). .(22) 

<223> xaa = any amino acid 



<400> 31 



xaa xaa Xaa Ala Phe Ser Xaa Glu Asn Xaa xaa Arg Xaa Xaa xaa Glu 
-> 10 



15 



Val xaa Xaa Leu Met Xaa Leu 
20 



<210> 32 

<211> 13 

<212> PRT 

<213> Artificial Sequence 
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<220> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



consensus sequence from Apfel CM et al. (J Bacterid 1999 Dan:181 
C2): 483-492) 

MISC_FEATURE 
C2) . - (2) 

Xaa = any amino acid 



MISC_FEATURE 
C3). .(3) 

Xaa sa any amino acid 



MIS C_F EATU R E 
(8) . . (8) 

Xaa = any amino acid 



MIS COFEATURE 
(9) . . (9) 
Xaa = D or E 



MI SC_F EATU R E 
(10) . . (10) 

Xaa = L or I or V or M 



MIS COFEATURE 
(11) . . (11) 

Xaa = any amino acid 



<220> 
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<221> MISCFEATURE 

<222> (12) . . (12) 

<223> Xaa = any amino acid 



<400> 32 

Ala Xaa Xaa Tyr Gly Gly Arg xaa Xaa Xaa Xaa Xaa Ala 
1 5 10 

<210> 33 

<211> 3 5 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> gjfjgf^sequence from Apfel CM et al . (3 'Bacterid 1999 Jan; 181 
<220> 

<221> MISC_FEATURE 

<222> (1) . . (1) 

<223> Xaa = D or E 



<220> 

<221> MISC_FEATURE 

<222> (3) . . (3) 

<223> xaa = any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (7) . . (7) 

<223> Xaa = S or A or G 



<220> 

<221> MISC_FEATURE 

<222> (10) . . (10) 

<223> Xaa = any amino acid 
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<220> 

<221> MIS COFEATURE 

<222> (12) . . (12) 

<223> xaa = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (16).. (16) 

<223> Xaa = M or L 

<220> 

<221> MISC__FEATURE 

<222> (17) . . (17) 

<223> Xaa = L or M or p 

<220> 

<221> MISC_FEATURE 

<222> (20) . . (20) 

<223> Xaa = any amino acid 

<220> 

<221> MISC__FEATURE 

<222> (21).. (21) 

<223> xaa = any amino acid 

<220> 

<221> MI5C_FEATURE 

<222> (23). .(23) 

<223> xaa = s or A or T 

<220> 

<221> MISC_FEATURE 

<222> (25).. (25) 

<223> xaa = any amino acid 
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<220> 

<221> MISC__FEATURE 

<222> (26). .(26) 

<223> xaa = any amino acid 

<220> 

<221> MIS COFEATURE 

<222> (28).. (28) 

<223> xaa = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (29) . . (29) 

<223> Xaa = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (30).. (30) 

<223> Xaa = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (31). .(31) 

<223> Xaa = any amino acid 

<220> 

<221> MI SC_ FEATURE 

<222> (34). .(34) 

<223> xaa = D or E 



<400> 33 



xaa Leu Xaa He Arg Thr xaa Gly Glu Xaa Arg xaa Ser Asn Phe Xaa 
D 10 15 

Xaa Trp Gin xaa Xaa Tyr xaa Glu Xaa Xaa Phe Xaa Xaa xaa xaa Trp 
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20 



25 



30 



Pro xaa Phe 
35 

<210> 34 
<211> 35 
<212> PRT 



<213> Artificial Sequence 
<220> 

<223> Consensus Sequence 
<400> 34 

Asp lie Leu Val Arg Ser ser Gly Glu Thr Arg Leu Ser Asn Phe Leu 
1 5 10 15 

Leu Trp Gin Thr Thr Asn Cys val Leu Tyr Ser Pro Lys Ala Leu Trp 
20 25 30 

Pro Glu Met 
35 

<210> 35 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Artificial and non-naturally occurring peptide 
<400> 35 

Glu Leu val lie Ser Leu lie Val Glu ser 
1 5 10 

<210> 36 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 
<223> 



Primer NKH46 
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<400> 36 

ttcgccggag ctccttacta a 

<210> 37 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer NKH45 

<400> 37 

cgttcatgac ccgtatgctt tct 

<210> 38 

<211> 11 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> consensus Sequence 
<220> 

<221> MI SC_ FEATURE 

<222> (4) . . (4) 

<223> xaa = L or M 

<400> 38 

Ala Phe He xaa Asp Gly Asn Arg Arg Phe Ala 
x 5 10 

<210> 39 

<211> 10 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> consensus Sequence 
<220> 

<221> MISCFEATURE 
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21 



23 
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<222> (2) . . C2) 
<223> Xaa = T or s 

<220> 

<221> MISC_FEATURE 

<222> (4) . . (4) 

<223> Xaa = any amino acid 

<220> 

<221> MISC^FEATURE 

<222> (5)..C5) 

<223> xaa = any amino acid 

<220> 

<221> MIS COFEATURE 

<222> (6). .(6) 

<223> xaa = D or E 

<220> 

<221> MISC_FEATURE 

<222> (8) . . (8) 

<223> Xaa = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> C9) . . (9) 

<223> Xaa = any amino acid 

<400> 39 

Tyr xaa ser xaa Xaa Xaa lie Xaa Xaa Ala 
1 .5 10. 

<210> 40 
<211> 22 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> consensus Sequence 
<220> 

<221> MISC_FEATURE 

<222> C2) . . C2) 

<223> xaa = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> C5) . . (5) 

<223> xaa = I or v 

<220> 

<221> MISC_FEATURE 

<222> C7) . . C7) 

<223> xaa = I or v 

<220> 

<221> MISC_FEATURE 

<222> (9) . . (9) 

<223> xaa = s or T 

<220> 

<221> MISC_FEATURE 

<222> (12) . . C12) 

<223> xaa = E or L 

<220> 

<221> MISC_FEATURE 

<222> C13) . . C13) 

<223> xaa = s or T 

<220> 
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<221> MISC_FEATURE 

<222> (18) . . (IS) 

<223> Xaa = any amino acid 

<400> 40 

pro xaa Pro Asp xaa Leu xaa Arg xaa ser Gly xaa Xaa Arg Leu Ser 
1 5 10 15 

Asn xaa Leu Leu Trp Gin 
20 

<210> 41 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer Dan5 

<400> 41 

ctcgacaatt tcaatcgacg cc 11 

<210> 42 

<211> 22 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Primer Dan6 

<400> 42 

gaaggaagtt gctcagcctt gt " 

<210> 43 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> primer DegHptS 
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<400> 43 

atawtggatg gaaacmggag g 21 

<210> 44 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer NKH5 
<220> 

<221> misc_feature 

<222> C6) . . (6) 

<223> n = inosine 

<220> 

<221> mis cofeature 

<222> C8)..(8) 

<223> n = inosine 



<400> 44 

tgccananha dgwarttrct 20 

<210> 45 
<211> 208 
<212> PRT 

<213> Taraxacum kok-saghyz 
<400> 45 

Leu val Phe He Leu Asp Gly Asn Arg Arg Phe Ala Arg Lys Trp Asn 
15 10 15 

Leu Thr Glu Gly Ala Gly His Lys Thr Gly phe Leu Ala Leu Met ser 
20 25 30 

val Leu Lys Tyr cys Tyr Glu lie Gly Val Lys Tyr Val Thr He Tyr 
35 40 45 

Ala Phe Ser Leu Asp Asn phe Asn Arg Arg Pro Asp Glu Val Gin Tyr 
50 55 60 
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val Met Asp Leu Met Gin Asp Lys lie Glu Gly phe Leu Lys Glu val 
65 70 75 80 



ser lie lie Asn Gin Tyr Gly val Arg Val Leu Phe lie Gly Asp Leu 
85 90 95 

Asp Arg Leu Tyr Glu Pro Val Arg lie Ala Ala Glu Lys Ala Met Glu 
100 105 110 

Ala Thr Ala Lys Asn Ser Thr Thr Tyr Leu Leu Val cys Val Ala Tvr 
115 120 125 

Thr ser ser His Glu lie Pro Arg Ala lie His Glu Ala cys Glu Glu 
130 135 140 

Ser lie Arg val Met Asn Gly Asn Gly Phe Phe Asn Gly Ser Gly Tvr 
145 150 155 160 

Thr Asn val Asn His Gly Ser Gin Ala Val lie Lys Val val Asp Leu 
165 170 175 

Asp Lys His Met Tyr Met Gly Val Ala Pro Asp Pro Asp lie Leu val 
180 185 190 

Arg Ser Ser Gly Glu Thr Arg Leu ser Asn Phe Leu Leu Trp His Lys 



195 



200 



205 
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